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LEUCINE-RICH REPEAT-CONTAINING G-PROTEIN COUPLED 
RECEPTOR-8 MOLECULES AND USES THEREOF 

This application claims the benefit of priority from U.S. Provisional Patent 
5 Application No. 60/224,455, filed on August 10, 2000, the disclosure of which is 
explicitly incorporated by reference herein. 

Field of the Invention 

The present invention relates to Leucine-Rich Repeat-Containing G-Protein 

10 Coupled Receptor-8 (LGR8) polypeptides and nucleic acid molecules encoding the 
same. The invention also relates to selective binding agents, vectors, host cells, and 
methods for producing LGR8 polypeptides. The invention further relates to 
pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, 
and/or prevention of diseases, disorders, and conditions associated with LGR8 

15 polypeptides. 

Background of the Invention 

Technical advances in the identification, cloning, expression, and 
manipulation of nucleic acid molecules and the deciphering of the human genome 
2 0 have greatly accelerated the discovery of novel therapeutics. Rapid nucleic acid 
sequencing techniques can now generate sequence information at unprecedented rates 
and, coupled with computational analyses, allow the assembly of overlapping 
sequences into partial and entire genomes and the identification of polypeptide- 
encoding regions. A comparison of a predicted amino acid sequence against a 

2 5 database compilation of known amino acid sequences allows one to determine the 

extent of homology to previously identified sequences and/or structural landmarks. 
The cloning and expression of a polypeptide-encoding region of a nucleic acid 
molecule provides a polypeptide product for structural and functional analyses. The 
manipulation of nucleic acid molecules and encoded polypeptides may confer 

3 0 advantageous properties on a product for use as a therapeutic. 

In spite of the significant technical advances in genome research over the past 
decade, the potential for the development of novel therapeutics based on the human 
genome is still largely unrealized. Many genes encoding potentially beneficial 
polypeptide therapeutics or those encoding polypeptides, which may act as "targets" 
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The present invention further relates to four distinct LGR8 alternative splicing 
variants. The LGR8-A coding sequence consists of 18 coding exons that encode a 
large N-terminal leucine-rich repeat-containing extracellular domain, seven predicted 
transmembrane domains, and a cytoplasmic C-terminal region. The LGR8-B coding 
5 sequence is identical to the LGR8-A coding sequence with the exception that the 
LGRS-B coding sequence lacks one of the exons encoding the N-terminal 
extracellular domain. The LGR8-C coding sequence is identical to the LGR8-A 
coding sequence with the exception that the LGR8-C coding sequence lacks three of 
the exons encoding the N-terminal extracellular domain. The LGR8-D coding 

10 sequence consists of exons encoding approximately 90% of the N-terminal 
extracellular domain of the LGR8-B coding sequence, but lacks exons encoding the 
transmembrane domains and the cytoplasmic C-terminal region. Thus, LGR8-D is a 
secreted, N-terminal extracellular domain version of LGR8-B and likely functions as 
an antagonist of the LGR8 signaling pathway. LGR8-D is truncated very near the C- 

15 terminal end of the N-terminal extracellular domain by virtue of the fact that an 
additional exon, which contains stop codons, is spliced in just 5' of the exon which 
encodes the first transmembrane domain of LGR8-A, LGR8-B, and LGR8-C. It is 
likely that the N-terminal extracellular domains of LGR8-A, LGR8-B, and LGR8-C 
would be able to function as antagonists of the LGR8 signaling pathway. 

20 

The invention provides for an isolated nucleic acid molecule comprising a 
nucleotide sequence selected from the group consisting of: 

(a) the nucleotide sequence as set forth in any of SEQ ID NO: 1, SEQ ED 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID 

2 5 NO: 16, SEQ ID NO: 19, or SEQ ID NO: 22; 

(b) a nucleotide sequence encoding the polypeptide as set forth in any of 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ED NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23; 

3 0 (c) a nucleotide sequence which hybridizes under moderately or highly 

stringent conditions to the complement of either (a) or (b); and 

(d) a nucleotide sequence complementary to either (a) or (b). 
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(a) a nucleotide sequence encoding a polypeptide as set forth in any of 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23 with at least one 
5 conservative amino acid substitution, wherein the encoded polypeptide has an activity 
of the polypeptide set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, 
or SEQ ID NO: 23; 

10 (b) a nucleotide sequence encoding a polypeptide as set forth in any of 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23 with at least one 
amino acid insertion, wherein the encoded polypeptide has an activity of the 

15 polypeptide set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ED NO: 5, SEQ 
ID NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ 
ED NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or 
SEQ ED NO: 23; 

(c) a nucleotide sequence encoding a polypeptide as set forth in any of 
2 0 SEQ ED NO: 2, SEQ ED NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ID NO: 8, SEQ 

ID NO: 10, SEQ ID NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ 
ED NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23 with at least one 
amino acid deletion, wherein the encoded polypeptide has an activity of the 
polypeptide set forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ 

2 5 ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ID NO: 13, SEQ 

ID NO: 15, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or 
SEQ ID NO: 23; 

(d) a nucleotide sequence encoding a polypeptide as set forth in any of 
SEQ ED NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ 

3 0 ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ 

ED NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23 which has a C- 
and/or N- terminal truncation, wherein the encoded polypeptide has an activity of the 
polypeptide set forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ 
ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ID NO: 13, SEQ 
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(c) an amino acid sequence which is at least about 70 percent identical to 
the amino acid sequence of any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ED NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ED NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 20, SEQ ID NO: 21, 

5 or SEQ ED NO: 23, wherein the polypeptide has an activity of the polypeptide set 
forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ 
ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ 
ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23; 

(d) a fragment of the amino acid sequence set forth in any of SEQ ED NO: 
10 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ID NO: 10, 

SEQ ED NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, 
SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23 comprising at least about 25 
amino acid residues, wherein the fragment has an activity of the polypeptide set forth 
in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED 
15 NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED 
NO: 17, SEQ ED NO: 18, SEQ ED NO: 20, SEQ ID NO: 21, or SEQ ED NO: 23, or is 
antigenic; and 

(e) an amino acid sequence for an allelic variant or splice variant of the 
amino acid sequence as set forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED 

2 0 NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ID 
NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 20, SEQ ID 
NO: 21, or SEQ ED NO: 23, or any of (a) - (c). 

The invention further provides for an isolated polypeptide comprising the 

2 5 amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence as set forth in any of SEQ ED NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ED NO: 
12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 
20, SEQ ED NO: 21, or SEQ ED NO: 23 with at least one conservative amino acid 

3 0 substitution, wherein the polypeptide has an activity of the polypeptide set forth in 

any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 
8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 
17, SEQ ED NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23; 
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The invention still further provides for an isolated polypeptide comprising the 
amino acid sequence as set forth in SEQ ID NO: 2 with at least one conservative 
amino acid substitution selected from the group consisting of: isoleucine at position 
5 26; valine at position 41; isoleucine at position 55; aspartic acid at position 78; 
aspartic acid at position 123; arginine at position 130; valine at position 135; 
methionine at position 142; leucine at position 166; tyrosine at position 167; lysine at 
position 201; valine at position 204; isoleucine at position 216; glutamatic acid at 
position 217; leucine at position 221; leucine at position 240; leucine at position 252; 

10 isoleucine at position 277; methionine at position 288; lysine at position 290; 
isoleucine at position 324; isoleucine at position 341; isoleucine at position 344; 
aspartic acid at position 350; leucine at position 376; valine at position 420; valine at 
position 425; valine at position 427; isoleucine at position 434; tyrosine at position 
442; arginine at position 444; tyrosine at position 450; isoleucine at position 466; 

15 isoleucine at position 471; leucine at position 476; phenylalanine at position 478; 
glutamatic acid at position 481; histidine at position 485; phenylalanine at position 
515; tyrosine at position 521; isoleucine at position 522; tyrosine at position 526; 
valine at position 531; valine at position 541; isoleucine at position 551; valine at 
position 552; glutamatic acid at position 561; phenylalanine at position 562; tyrosine 

2 0 at position 566; tyrosine at position 577; aspartic acid at position 579; isoleucine at 
position 597; isoleucine at position 603; valine at position 616; isoleucine at position 
621; isoleucine at position 626; lysine at position 632; leucine at position 649; 
isoleucine at position 654; valine at position 675; isoleucine at position 682; 
glutamatic acid at position 700; isoleucine at position 702; tyrosine at position 707; 

2 5 tyrosine at position 709; isoleucine at position 727; valine at position 729; methionine 

at position 737; methionine at position 745; and leucine at position 749; wherein the 
polypeptide has an activity of the polypeptide as set forth in SEQ ID NO: 2. 

Also provided are fusion polypeptides comprising LGR8 amino acid 

3 0 sequences. 

The present invention also provides for an expression vector comprising the 
isolated nucleic acid molecules as set forth herein, recombinant host cells comprising 
the recombinant nucleic acid molecules as set forth herein, and a method of producing 
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Methods of regulating expression and modulating (i.e., increasing or 
decreasing) levels of an LGR8 polypeptide are also encompassed by the invention. 
One method comprises administering to an animal a nucleic acid molecule encoding 
an LGR8 polypeptide. In another method, a nucleic acid molecule comprising 
5 elements that regulate or modulate the expression of an LGR8 polypeptide may be 
administered. Examples of these methods include gene therapy, cell therapy, and 
anti-sense therapy as further described herein. 

LGR8 polypeptides can be used for identifying ligands thereof. Various forms 
of "expression cloning" have been used for cloning ligands for receptors (See, e.g., 

10 Davis et al, 1996, Cell, 87:1161-69). These and other LGR8 ligand cloning 
experiments are described in greater detail herein. Isolation of the LGR8 ligand(s) 
allows for the identification or development of novel agonists and/or antagonists of 
the LGR8 signaling pathway. Such agonists and antagonists include LGR8 ligand(s), 
anti-LGR8 ligand antibodies and derivatives thereof, small molecules, or antisense 

15 oligonucleotides, any of which can be used for potentially treating one or more 
diseases or disorders, including those recited herein. 

Brief Description of the Figures 

Figures 1A-1D illustrate a nucleotide sequence (SEQ ID NO: 1) encoding human 

2 0 LGR8-A (SEQ ID NO: 2). The predicted signal sequence is indicated (underline) ; 

Figures 2A-2B illustrate a nucleotide sequence (SEQ ID NO: 4) encoding the N- 
terminal extracellular domain (absent the signal peptide) of human LGR8-A (SEQ ID 
NO: 5); 

25 

Figures 3A-3D illustrate a nucleotide sequence (SEQ ID NO: 6) encoding human 
LGR8-B (SEQ ID NO: 7). The predicted signal sequence is indicated (underline) ; 

Figures 4A-4B illustrate a nucleotide sequence (SEQ ID NO: 9) encoding the N- 

3 0 terminal extracellular domain (absent the signal peptide) of human LGR8-B (SEQ ED 

NO: 10); 

Figures 5A-5D illustrate a nucleotide sequence (SEQ ID NO: 11) encoding human 
LGR8-C (SEQ ID NO: 12). The predicted signal sequence is indicated (underline) ; 
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transmembrane domains, and a C-terminai cytoplasmic domain. Accordingly, LGRS- 
A, LGR8-B, and LGR8-C are useful as targets for agonistic or antagonistic molecules, 
including, but not limited to, antibodies, fusion polypeptides, carbohydrates, 
polynucleotides (such as antisense oligonucleotides), or small molecular weight 
5 organic molecules. 

Additionally, it will be understood that the N-terminal extracellular domains 
of LGRS-A, LGR8-B, and LGR8-C can be used as antagonists of the LGR8 signaling 
pathway, for example, where the N-terminal extracellular domain is fused to an Fc 
portion of an antibody. 

10 It will also be appreciated that LGR8-D is a secreted form of the N-terminal 

extracellular domain of LGR8-B. In this regard LGR8-D may act as an antagonist of 
the LGR8-B ligand(s). LGR8-D can be used as a target for antagonistic and agonistic 
molecules, including, but not limited to, antibodies, fusion polypeptides, 
carbohydrates, polynucleotides (such as antisense oligonucleotides), or small 

15 molecular weight organic molecules. For example, an antagonist specific for LGR8- 
D would inhibit the antagonistic activity of LGR8-D, thus enhancing the activity of 
LGR8-D ligand(s) and/or enhancing signaling through LGR8 receptors. Conversely 
an agonist specific for LGR8-D would enhance the antagonistic activity of LGR8-D, 
thus diminishing the activity of LGR8-D ligand(s) and/or diminishing signaling 

2 0 through LGR8. 

Definitions 

The terms "LGR8 gene" or "LGR8 nucleic acid molecule" or "LGR8 
polynucleotide" refer to a nucleic acid molecule comprising or consisting of a 

2 5 nucleotide sequence as set forth in any of SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID 

NO: 6, SEQ ED NO: 9, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID 
NO: 19, or SEQ ID NO: 22, a nucleotide sequence encoding the polypeptide as set 
forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ 

3 0 ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23, 

and nucleic acid molecules as defined herein. 

The term "LGR8 polypeptide allelic variant" refers to one of several possible 
naturally occurring alternate forms of a gene occupying a given locus on a 
chromosome of an organism or a population of organisms. 
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The term "vector" is used to refer to any molecule (e.g., nucleic acid, plasmid, 
or virus) used to transfer coding information to a host cell. 

The term "expression vector" refers to a vector that is suitable for 
transformation of a host cell and contains nucleic acid sequences that direct and/or 
5 control the expression of inserted heterologous nucleic acid sequences. Expression 
includes, but is not limited to, processes such as transcription, translation, and RNA 
splicing, if introns are present. 

The term "operably linked" is used herein to refer to an arrangement of 
flanking sequences wherein the flanking sequences so described are configured or 
10 assembled so as to perform their usual function. Thus, a flanking sequence operably 
linked to a coding sequence may be capable of effecting the replication, transcription 
and/or translation of the coding sequence. For example, a coding sequence is 
operably linked to a promoter when the promoter is capable of directing transcription 
of that coding sequence. A flanking sequence need not be contiguous with the coding 
15 sequence, so long as it functions correctly. Thus, for example, intervening 
untranslated yet transcribed sequences can be present between a promoter sequence 
and the coding sequence and the promoter sequence can still be considered "operably 
linked" to the coding sequence. 

The term "host cell" is used to refer to a cell which has been transformed, or is 
2 0 capable of being transformed with a nucleic acid sequence and then of expressing a 
selected gene of interest. The term includes the progeny of the parent cell, whether or 
not the progeny is identical in morphology or in genetic make-up to the original 
parent, so long as the selected gene is present. 

The term "LGR8 polypeptide" refers to a polypeptide comprising the amino 

2 5 acid sequence of any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 

7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 
15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID 
NO: 23 and related polypeptides. Related polypeptides include LGR8 polypeptide 
fragments, LGR8 polypeptide orthologs, LGR8 polypeptide variants, and LGRS 

3 0 polypeptide derivatives, which possess at least one activity of the polypeptide as set 

forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ 
ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23. 
LGR8 polypeptides may be mature polypeptides, as defined herein, and may or may 
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polypeptides) as compared to the LGR8 polypeptide amino acid sequence set forth in 
any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ED NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23 (with or 
5 without a leader sequence). Variants may be naturally occurring (e.g-., LGR8 
polypeptide allelic variants, LGR8 polypeptide orthologs, and LGR8 polypeptide 
splice variants) or artificially constructed. Such LGR8 polypeptide variants may be 
prepared from the corresponding nucleic acid molecules having a DNA sequence that 
varies accordingly from the DNA sequence as set forth in any of SEQ ID NO: 1, SEQ 

10 ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 19, or SEQ ID NO: 22. In preferred embodiments, the variants 
have from 1 to 3, or from 1 to 5, or from 1 to 10, or from 1 to 15, or from 1 to 20, or 
from 1 to 25, or from 1 to 50, or from 1 to 75, or from 1 to 100, or more than 100 
amino acid substitutions, insertions, additions and/or deletions, wherein the 

1 5 substitutions may be conservative, or non-conservative, or any combination thereof. 

The term "LGR8 polypeptide derivatives" refers to the polypeptide as set forth 
in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23, 

2 0 LGR8 polypeptide fragments, LGR8 polypeptide orthologs, or LGR8 polypeptide 
variants, as defined herein, that have been chemically modified. The term "LGR8 
polypeptide derivatives" also refers to the polypeptides encoded by LGR8 polypeptide 
allelic variants or LGR8 polypeptide splice variants, as defined herein, that have been 
chemically modified. 

2 5 The term "mature LGR8 polypeptide" refers to an LGR8 polypeptide lacking 

a leader sequence. A mature LGR8 polypeptide may also include other modifications 
such as proteolytic processing of the amino-terminus (with or without a leader 
sequence) and/or the carboxyl-terminus, cleavage of a smaller polypeptide from a 
larger precursor, N-linked and/or O-linked glycosylation, and the like. Exemplary 

3 0 mature CHL2 polypeptides are depicted by the amino acid sequences of SEQ ID NO: 

3, SEQ ID NO: 8, SEQ ID NO: 13, SEQ ID NO: 18, OR SEQ ID NO: 21. 

The term "LGR8 fusion polypeptide" refers to a fusion of one or more amino 
acids (such as a heterologous protein or peptide) at the amino- or carboxyl-terminus 
of the polypeptide as set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 
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alignments (if any) addressed by a particular mathematical model or computer 
program (i.e., "algorithms"). 

The term "similarity" is a related concept, but in contrast to "identity," 
"similarity" refers to a measure of relatedness that includes both identical matches and 
5 conservative substitution matches. If two polypeptide sequences have, for example, 
10/20 identical amino acids, and the remainder are all non-conservative substitutions, 
then the percent identity and similarity would both be 50%. If in the same example, 
there are five more positions where there are conservative substitutions, then the 
percent identity remains 50%, but the percent similarity would be 75% (15/20). 
1 0 Therefore, in cases where there are conservative substitutions, the percent similarity 
between two polypeptides will be higher than the percent identity between those two 
polypeptides. 

The term "naturally occurring" or "native" when used in connection with 
biological materials such as nucleic acid molecules, polypeptides, host cells, and the 
1 5 like, refers to materials which are found in nature and are not manipulated by man. 
Similarly, "non-naturally occurring" or "non-native" as used herein refers to a 
material that is not found in nature or that has been structurally modified or 
synthesized by man. 

The terms "effective amount" and "therapeutically effective amount" each 
2 0 refer to the amount of an LGR8 polypeptide or LGR8 nucleic acid molecule used to 
support an observable level of one or more biological activities of the LGR8 
polypeptides as set forth herein. 

The term "pharmaceutical^ acceptable carrier" or "physiologically acceptable 
carrier" as used herein refers to one or more formulation materials suitable for 

2 5 accomplishing or enhancing the delivery of the LGR8 polypeptide, LGR8 nucleic acid 

molecule, or LGR8 selective binding agent as a pharmaceutical composition. 

The term "antigen" refers to a molecule or a portion of a molecule capable of 
being bound by a selective binding agent, such as an antibody, and additionally 
capable of being used in an animal to produce antibodies capable of binding to an 

3 0 epitope of that antigen. An antigen may have one or more epitopes. 

The term "selective binding agent" refers to a molecule or molecules having 
specificity for an LGR8 polypeptide. As used herein, the terms, "specific" and 
"specificity" refer to the ability of the selective binding agents to bind to human 
LGR8 polypeptides and not to bind to human non-LGR8 polypeptides. It will be 
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residues compared to the polypeptide as set forth in any of SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: IS, SEQ ID NO: 
20, SEQ ID NO: 21, or SEQ ID NO: 23. Such related LGR8 polypeptides may 
5 comprise, for example, an addition and/or a deletion of one or more N-linked or O- 
linked glycosylation sites or an addition and/or a deletion of one or more cysteine 
residues. 

Related nucleic acid molecules also include fragments of LGR8 nucleic acid 
molecules which encode a polypeptide of at least about 25 contiguous amino acids, or 
10 about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or about 
150 amino acids, or about 200 amino acids, or more than 200 amino acid residues of 
the LGR8 polypeptide of any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or 

15 SEQ ID NO: 23. 

In addition, related LGR8 nucleic acid molecules also include those molecules 
which comprise nucleotide sequences which hybridize under moderately or highly 
stringent conditions as defined herein with the fully complementary sequence of the 
LGR8 nucleic acid molecule of any of SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 6, 
20 SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 19, 
or SEQ ID NO: 22, or of a molecule encoding a polypeptide, which polypeptide 
comprises the amino acid sequence as shown in any of SEQ ID NO: 2, SEQ ID NO: 
3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, 

25 SEQ ID NO: 21, or SEQ ID NO: 23, or of a nucleic acid fragment as defined herein, 
or of a nucleic acid fragment encoding a polypeptide as defined herein. Hybridization 
probes may be prepared using the LGR8 sequences provided herein to screen cDNA, 
genomic or synthetic DNA libraries for related sequences. Regions of the DNA 
and/or amino acid sequence of LGR8 polypeptide that exhibit significant identity to 

3 0 known sequences are readily determined using sequence alignment algorithms as 
described herein and those regions may be used to design probes for screening. 

The term "highly stringent conditions" refers to those conditions that are 
designed to permit hybridization of DNA strands whose sequences are highly 
complementary, and to exclude hybridization of significantly mismatched DNAs. 
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stringent conditions" are 0.015 M sodium chloride, 0.0015 M sodium citrate at 50- 
65°C or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 20% formamide at 
37-50°C. By way of example, "moderately stringent conditions" of 50°C in 0.015 M 
sodium ion will allow about a 21% mismatch. 
5 It will be appreciated by those skilled in the art that there is no absolute 

distinction between "highly stringent conditions" and "moderately stringent 
conditions." For example, at 0.015 M sodium ion (no formamide), the melting 
temperature of perfectly matched long DNA is about 71°C. With a wash at 65°C (at 
the same ionic strength), this would allow for approximately a 6% mismatch. To 
1 0 capture more distantly related sequences, one skilled in the art can simply lower the 
temperature or raise the ionic strength. 

A good estimate of the melting temperature in 1M NaCl* for oligonucleotide 
probes up to about 20nt is given by: 

Tm = 2°C per A-T base pair + 4°C per G-C base pair 
15 *The sodium ion concentration in 6X salt sodium citrate (SSC) is 1M. See Suggs et 
aL, Developmental Biology Using Purified Genes 683 (Brown and Fox, eds., 1981). 

High stringency washing conditions for oligonucleotides are usually at a 
temperature of 0-5°C below the Tm of the oligonucleotide in 6X SSC, 0.1% SDS. 

In another embodiment, related nucleic acid molecules comprise or consist of 
2 0 a nucleotide sequence that is at least about 70 percent identical to the nucleotide 
sequence as shown in any of SEQ ID NO: 1, SEQ ID NO; 4, SEQ ID NO: 6, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 19, or SEQ 
ID NO: 22, or comprise or consist essentially of a nucleotide sequence encoding a 
polypeptide that is at least about 70 percent identical to the polypeptide as set forth in 

2 5 any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, 

SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23. In preferred 
embodiments, the nucleotide sequences are about 75 percent, or about 80 percent, or 
about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical 

3 0 to the nucleotide sequence as shown in any of SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID 

NO: 6, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID 
NO: 19, or SEQ ID NO: 22, or the nucleotide sequences encode a polypeptide that is 
about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or 
about 95, 96, 97, 98, or 99 percent identical to the polypeptide sequence as set forth in 
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Conservative amino acid substitutions also encompass non-naturally occurring 
amino acid residues that are typically incorporated by chemical peptide synthesis 
rather than by synthesis in biological systems. These include peptidomimetics, and 
other reversed or inverted forms of amino acid moieties. 
5 Naturally occurring residues may be divided into classes based on common 

side chain properties: 

1) hydrophobic: norleucine, Met, Ala, Val, Leu, lie; 

2) neutral hydrophilic: Cys, Ser, Thr; 

3) acidic: Asp, Glu; 

10 4) basic: Asn 5 Gin, His, Lys, Arg; 

5) residues that influence chain orientation: Gly, Pro; and 

6) aromatic: Trp, Tyr, Phe. 

For example, non-conservative substitutions may involve the exchange of a 

member of one of these classes for a member from another class. Such substituted 
15 residues may be introduced into regions of the human LGR8 polypeptide that are 

homologous with non-human LGR8 polypeptides, or into the non-homologous 

regions of the molecule. 

In making such changes, the hydropathic index of amino acids may be 

considered. Each amino acid has been assigned a hydropathic index on the basis of its 
2 0 hydrophobicity and charge characteristics. The hydropathic indices are: isoleucine 

(+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); 

methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 

tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 

glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

2 5 The importance of the hydropathic amino acid index in conferring interactive 

biological function on a protein is generally understood in the art (Kyte et al, 1982, J. 
MoL Biol. 157:105-31). It is known that certain amino acids may be substituted for 
other amino acids having a similar hydropathic index or score and still retain a similar 
biological activity. In making changes based upon the hydropathic index, the 

3 0 substitution of amino acids whose hydropathic indices are within ±2 is preferred, 

those which are within ±1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be 
made effectively on the basis of hydrophilicity, particularly where the biologically 
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A skilled artisan will be able to determine suitable variants of the polypeptide 
as set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, 
SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, 
5 SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 
23 using well-known techniques. For identifying suitable areas of the molecule that 
may be changed without destroying biological activity, one skilled in the art may 
target areas not believed to be important for activity. For example, when similar 
polypeptides with similar activities from the same species or from other species are 

1 0 known, one skilled in the art may compare the amino acid sequence of an LGR8 
polypeptide to such similar polypeptides. With such a comparison, one can identify 
residues and portions of the molecules that are conserved among similar polypeptides. 
It will be appreciated that changes in areas of the LGR8 molecule that are not 
conserved relative to such similar polypeptides would be less likely to adversely 

1 5 affect the biological activity and/or structure of an LGR8 polypeptide. One skilled in 
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computer programs are currently available to assist with predicting secondary 
structure. One method of predicting secondary structure is based upon homology 
modeling. For example, two polypeptides or proteins that have a sequence identity of 
greater than 30%, or similarity greater than 40%, often have similar structural 
5 topologies. The recent growth of the protein structural database (PDB) has provided 
enhanced predictability of secondary structure, including the potential number of 
folds within the structure of a polypeptide or protein. See Holm et aL, 1999, Nucleic 
Acids Res. 27:244-47. It has been suggested that there are a limited number of folds 
in a given polypeptide or protein and that once a critical number of structures have 

1 0 been resolved, structural prediction will become dramatically more accurate (Brenner 
etaL, 1997, Curr. Opin. Struct Biol 7:369-76). 

Additional methods of predicting secondary structure include "threading" 
(Jones, 1997, Curr. Opin. Struct. Biol Sippl et aL, 1996, Structure 4:15- 

19), "profile analysis" (Bowie et aL, 1991, Science, 253:164-70; Gribskov et aL, 

15 1990, Methods EnzymoL 183:146-59; Gribskov et aL, 1987, Proc. Nat. Acad. Sci. 
U.S.A. 84:4355-58), and "evolutionary linkage" (See Holm et aL, supra, and Brenner 
et aL, supra). 

Preferred LGR8 polypeptide variants include glycosylation variants wherein 
the number and/or type of glycosylation sites have been altered compared to the 
2 0 amino acid sequence set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, 
or SEQ ED NO: 23. In one embodiment, LGR8 polypeptide variants comprise a 
greater or a lesser number of N-linked glycosylation sites than the amino acid 

2 5 sequence set forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ 
ID NO: 23. An N-linked glycosylation site is characterized by the sequence: Asn-X- 
Ser or Asn-X-Thr, wherein the amino acid residue designated as X may be any amino 

3 0 acid residue except proline. The substitution of amino acid residues to create this 

sequence provides a potential new site for the addition of an N-linked carbohydrate 
chain. Alternatively, substitutions that eliminate this sequence will remove an 
existing N-linked carbohydrate chain. Also provided is a rearrangement of N-linked 
carbohydrate chains wherein one or more N-linked glycosylation sites (typically those 
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SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ 
ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ 
ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23. Related nucleic acid molecules also 
comprise or consist of a nucleotide sequence encoding a polypeptide as set forth in 
5 any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 
8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 
17, SEQ ED NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23 with at 
least one modification selected from the group consisting of amino acid substitutions, 
amino acid insertions, amino acid deletions, carboxyl-terminal truncations, and 
10 amino-terminal truncations and wherein the polypeptide has an activity of the 
polypeptide set forth in any of SEQ ED NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or 
SEQ ID NO: 23. 

15 In addition, the polypeptide comprising the amino acid sequence of any of 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23, or other LGR8 
polypeptide, may be fused to a homologous polypeptide to form a homodimer or to a 

2 0 heterologous polypeptide to form a heterodimer. Heterologous peptides and 
polypeptides include, but are not limited to: an epitope to allow for the detection 
and/or isolation of an LGR8 fusion polypeptide; a transmembrane receptor protein or 
a portion thereof, such as an extracellular domain or a transmembrane and 
intracellular domain; a ligand or a portion thereof which binds to a transmembrane 

2 5 receptor protein; an enzyme or portion thereof which is catalytically active; a 

polypeptide or peptide which promotes oligomerization, such as a leucine zipper 
domain; a polypeptide or peptide which increases stability, such as an 
immunoglobulin constant region; and a polypeptide which has a therapeutic activity 
different from the polypeptide comprising the amino acid sequence as set forth in any 

3 0 of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, 

SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23, or other 
LGR8 polypeptide. 
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In one example, a human IgG hinge, CH2, and CH3 region may be fused at 
either the ammo-terminus or carboxyl-terminus of the LGR8 polypeptides using 
methods known to the skilled artisan. In another example, a human IgG hinge, CH2, 
5 and CH3 region may be fused at either the ammo-terminus or carboxyl-terminus of an 
LGR8 polypeptide fragment (e.g., the predicted extracellular portion of LGR8 
polypeptide). 

The resulting LGR8 fusion polypeptide may be purified by use of a Protein A 
affinity column. Peptides and proteins fused to an Fc region have been found to 
10 exhibit a substantially greater half-life in vivo than the unfused counterpart. Also, a 
fusion to an Fc region allows for dimerization/multimerization of the fusion 
polypeptide. The Fc region may be a naturally occurring Fc region, or may be altered 
to improve certain qualities, such as therapeutic qualities, circulation time, or reduced 
aggregation. 

1 5 Identity and similarity of related nucleic acid molecules and polypeptides are 

readily calculated by known methods. Such methods include, but are not limited to 
those described in Computational Molecular Biology (A.M. Lesk, ed., Oxford 
University Press 1988); Biocomputing: Informatics and Genome Projects (D.W. 
Smith, ed., Academic Press 1993); Computer Analysis of Sequence Data (Part 1, 

2 0 A.M. Griffin and H.G. Griffin, eds., Humana Press 1994); G. von Heinle, Sequence 
Analysis in Molecular Biology (Academic Press 1987); Sequence Analysis Primer (M. 
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Algorithm: Needleman and Wunsch, 1970,7. MoL Biol 48:443-53; 
Comparison matrix: BLOSUM 62 (Henikoff et aL, snpra)\ 
Gap Penalty: 12 
5 Gap Length Penalty: 4 

Threshold of Similarity: 0 

The GAP program is useful with the above parameters. The aforementioned 
parameters are the default parameters for polypeptide comparisons (along with no 
1 0 penalty for end gaps) using the GAP algorithm. 

Preferred parameters for nucleic acid molecule sequence comparison include 
the following: 

Algorithm: Needleman and Wunsch, supra; 
15 Comparison matrix: matches = +10, mismatch = 0 

Gap Penalty: 50 
Gap Length Penalty: 3 

The GAP program is also useful with the above parameters. The aforementioned 
2 0 parameters are the default parameters for nucleic acid molecule comparisons. 

Other exemplary algorithms, gap opening penalties, gap extension penalties, 
comparison matrices, and thresholds of similarity may be used, including those set 
forth in the Program Manual, Wisconsin Package, Version 9, September, 1997. The 
particular choices to be made will be apparent to those of skill in the art and will 
2 5 depend on the specific comparison to be made, such as DNA-to-DNA, protein-to- 
protein, protein-to-DNA; and additionally, whether the comparison is between given 
pairs of sequences (in which case GAP or BestFit are generally preferred) or between 
one sequence and a large database of sequences (in which case FASTA or BLASTA 
are preferred). 

30 

Nucleic Acid Molecules 

The nucleic acid molecules encoding a polypeptide comprising the amino acid 
sequence of an LGR8 polypeptide can readily be obtained in a variety of ways 
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introducing the expression vector into an appropriate host, the encoded LGR8 
polypeptide may be produced in large amounts. 

Another method for obtaining a suitable nucleic acid sequence is the 
polymerase chain reaction (PCR). In this method, cDNA is prepared from 
5 poly(A)4-RNA or total RNA using the enzyme reverse transcriptase. Two primers, 
typically complementary to two separate regions of cDNA encoding the amino acid 
sequence of an LGR8 polypeptide, are then added to the cDNA along with a 
polymerase such as Tag polymerase, and the polymerase amplifies the cDNA region 
between the two primers. 

10 Another means of preparing a nucleic acid molecule encoding the amino acid 

sequence of an LGR8 polypeptide is chemical synthesis using methods well known to 
the skilled artisan such as those described by Engels et al, 1989, Angew. Chem. Intl. 
Ed. 28:716-34. These methods include, inter alia, the phosphotriester, 
phosphoramidite, and H-phosphonate methods for nucleic acid synthesis. A preferred 

15 method for such chemical synthesis is polymer-supported synthesis using standard 
phosphoramidite chemistry. Typically, the DNA encoding the amino acid sequence 
of an LGR8 polypeptide will be several hundred nucleotides in length. Nucleic acids 
larger than about 100 nucleotides can be synthesized as several fragments using these 
methods. The fragments can then be ligated together to form the full-length 

2 0 nucleotide sequence of an LGR8 gene. Usually, the DNA fragment encoding the 
amino-terminus of the polypeptide will have an ATG, which encodes a methionine 
residue. This methionine may or may not be present on the mature form of the LGR8 
polypeptide, depending on whether the polypeptide produced in the host cell is 
designed to be secreted from that cell. Other methods known to the skilled artisan 

2 5 may be used as well. 

In certain embodiments, nucleic acid variants contain codons which have been 
altered for optimal expression of an LGR8 polypeptide in a given host cell. Particular 
codon alterations will depend upon the LGR8 polypeptide and host cell selected for 
expression. Such "codon optimization" can be carried out by a variety of methods, 

3 0 for example, by selecting codons which are preferred for use in highly expressed 

genes in a given host cell. Computer algorithms which incorporate codon frequency 
tables such as "Eco_high.Cod" for codon preference of highly expressed bacterial 
genes may be used and are provided by the University of Wisconsin Package Version 
9.0 (Genetics Computer Group, Madison, WI). Other useful codon frequency tables 
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to be expressed, and a selectable marker element. Each of these sequences is 
discussed below. 

Optionally, the vector may contain a "tag"-encoding sequence, i.e., an 
oligonucleotide molecule located at the 5' or 3 5 end of the LGR8 polypeptide coding 
5 sequence; the oligonucleotide sequence encodes poIyHis (such as hexaHis), or 
another "tag" such as FLAG, HA (hemaglutinin influenza virus), or myc for which 
commercially available antibodies exist. This tag is typically fused to the polypeptide 
upon expression of the polypeptide, and can serve as a means for affinity purification 
of the LGR8 polypeptide from the host cell. Affinity purification can be 
1 0 accomplished, for example, by column chromatography using antibodies against the 
tag as an affinity matrix. Optionally, the tag can subsequently be removed from the 
purified LGR8 polypeptide by various means such as using certain peptidases for 
cleavage. 

Flanking sequences may be homologous (i.e., from the same species and/or 
15 strain as the host cell), heterologous (i.e., from a species other than the host cell 
species or strain), hybrid (i.e., a combination of flanking sequences from more than 
one source), or synthetic, or the flanking sequences may be native sequences that 
normally function to regulate LGR8 polypeptide expression. As such, the source of a 
flanking sequence may be any prokaryotic or eukaryotic organism, any vertebrate or 
2 0 invertebrate organism, or any plant, provided that the flanking sequence is functional 
in, and can be activated by, the host cell machinery. 

Flanking sequences useful in the vectors of this invention may be obtained by 
any of several methods well known in the art. Typically, flanking sequences useful 
herein - other than the LGR8 gene flanking sequences - will have been previously 

2 5 identified by mapping and/or by restriction endonuclease digestion and can thus be 

isolated from the proper tissue source using the appropriate restriction endonucleases. 
In some cases, the full nucleotide sequence of a flanking sequence may be known. 
Here, the flanking sequence may be synthesized using the methods described herein 
for nucleic acid synthesis or cloning. 

3 0 Where all or only a portion of the flanking sequence is known, it may be 

obtained using PCR and/or by screening a genomic library with a suitable 
oligonucleotide and/or flanking sequence fragment from the same or another species. 
Where the flanking sequence is not known, a fragment of DNA containing a flanking 
sequence may be isolated from a larger piece of DNA that may contain, for example, 
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Other selection genes may be used to amplify the gene that will be expressed. 
Amplification is the process wherein genes that are in greater demand for the 
production of a protein critical for growth are reiterated in tandem within the 
chromosomes of successive generations of recombinant cells. Examples of suitable 
5 selectable markers for mammalian cells include dihydrofolate reductase (DHFR) and 
thymidine kinase. The mammalian cell transformants are placed under selection 
pressure wherein only the transformants are uniquely adapted to survive by virtue of 
the selection gene present in the vector. Selection pressure is imposed by culturing 
the transformed cells under conditions in which the concentration of selection agent in 

1 0 the medium is successively changed, thereby leading to the amplification of both the 
selection gene and the DNA that encodes an LGR8 polypeptide. As a result, 
increased quantities of LGR8 polypeptide are synthesized from the amplified DNA. 

A ribosome binding site is usually necessary for translation initiation of 
mRNA and is characterized by a Shine-Dalgarno sequence (prokaryotes) or a Kozak 

15 sequence (eukaiyotes). The element is typically located 3' to the promoter and 5' to 
the coding sequence of an LGR8 polypeptide to be expressed. The Shine-Dalgarno 
sequence is varied but is typically a polypurine (i.e., having a high A-G content). 
Many Shine-Dalgarno sequences have been identified, each of which can be readily 
synthesized using methods set forth herein and used in a prokaryotic vector. 

2 0 A leader, or signal, sequence may be used to direct an LGR8 polypeptide out 

of the host cell. Typically, a nucleotide sequence encoding the signal sequence is 
positioned in the coding region of an LGR8 nucleic acid molecule, or directly at the 
5' end of an LGR8 polypeptide coding region. Many signal sequences have been 
identified, and any of those that are functional in the selected host cell may be used in 

2 5 conjunction with an LGR8 nucleic acid molecule. Therefore, a signal sequence may 

be homologous (naturally occurring) or heterologous to the LGR8 nucleic acid 
molecule. Additionally, a signal sequence may be chemically synthesized using 
methods described herein. In most cases, the secretion of an LGR8 polypeptide from 
the host cell via the presence of a signal peptide will result in the removal of the 

3 0 signal peptide from the secreted LGR8 polypeptide. The signal sequence may be a 

component of the vector, or it may be a part of an LGR8 nucleic acid molecule that is 
inserted into the vector. 

Included within the scope of this invention is the use of either a nucleotide 
sequence encoding a native LGR8 polypeptide signal sequence joined to an LGR8 
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not interrupt the coding sequence. Any intron from any source, including viral, 
prokaryotic and eukaryotic (plant or animal) organisms, may be used to practice this 
invention, provided that it is compatible with the host cell into which it is inserted. 
Also included herein are synthetic introns. Optionally, more than one intron may be 
5 used in the vector. 

The expression and cloning vectors of the present invention will typically 
contain a promoter that is recognized by the host organism and operably linked to the 
molecule encoding the LGR8 polypeptide. Promoters are untranscribed sequences 
located upstream (i.e., 5') to the start codon of a structural gene (generally within 

10 about 100 to 1000 bp) that control the transcription of the structural gene. Promoters 
are conventionally grouped into one of two classes: inducible promoters and 
constitutive promoters. Inducible promoters initiate increased levels of transcription 
from DNA under their control in response to some change in culture conditions, such 
as the presence or absence of a nutrient or a change in temperature. Constitutive 

1 5 promoters, on the other hand, initiate continual gene product production; that is, there 
is little or no control over gene expression. A large number of promoters, recognized 
by a variety of potential host cells, are well known. A suitable promoter is operably 
linked to the DNA encoding LGR8 polypeptide by removing the promoter from the 
source DNA by restriction enzyme digestion and insert :ng the desired promoter 

2 0 sequence into the vector. The native LGR8 promoter sequence may be used to direct 
amplification and/or expression of an LGR8 nucleic acid molecule. A heterologous 
promoter is preferred, however, if it permits greater transcription and higher yields of 
the expressed protein as compared to the native promoter, and if it is compatible with 
the host cell system that has been selected for use. 

2 5 Promoters suitable for use with prokaryotic hosts include the beta-lactamase 

and lactose promoter systems; alkaline phosphatase; a tryptophan (trp) promoter 
system; and hybrid promoters such as the tac promoter. Other known bacterial 
promoters are also suitable. Their sequences have been published, thereby enabling 
one skilled in the art to Iigate them to the desired DNA sequence, using linkers or 

3 0 adapters as needed to supply any useful restriction sites. 

Suitable promoters for use with yeast hosts are also well known in the art. 
Yeast enhancers are advantageously used with yeast promoters. Suitable promoters 
for use with mammalian host cells are well known and include, but are not limited to, 
those obtained from the genomes of viruses such as polyoma virus, fowlpox virus, 
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gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et aL, 1986, Science 234:1372-78). 

An enhancer sequence may be inserted into the vector to increase the 
transcription of a DNA encoding an LGR8 polypeptide of the present invention by 
5 higher eukaryotes. Enhancers are cis-acting elements of DNA, usually about 10-300 
bp in length, that act on the promoter to increase transcription. Enhancers are 
relatively orientation and position independent. They have been found 5* and 3 ? to 
the transcription unit. Several enhancer sequences available from mammalian genes 
are known (e.g., globin, elastase, albumin, alpha- feto-protein and insulin). Typically, 

10 however, an enhancer from a virus will be used. The SV40 enhancer, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer, and adenovirus 
enhancers are exemplary enhancing elements for the activation of eukaryotic 
promoters. While an enhancer may be spliced into the vector at a position 5' or 3' to 
an LGR8 nucleic acid molecule, it is typically located at a site 5' from the promoter. 

1 5 Expression vectors of the invention may be constructed from a starting vector 

such as a commercially available vector. Such vectors may or may not contain all of 
the desired flanking sequences. Where one or more of the flanking sequences 
described herein are not already present in the vector, they may be individually 
obtained and ligated into the vector. Methods used for obtaining each of the flanking 

2 0 sequences are well known to one skilled in the art. 

Preferred vectors for practicing this invention are those that are compatible 
with bacterial, insect, and mammalian host cells. Such vectors include, inter alia, 
pCRII, pCR3, and pcDNA3.1 (Invitrogen, San Diego, CA), pBSII (Stratagene, La 
Jolla, CA), pET15 (Novagen, Madison, WI), pGEX (Pharmacia Biotech, Piscataway, 

2 5 NJ), pEGFP-N2 (Clontech, Palo Alto, CA), pETL (BlueBacH, Invitrogen), pDSR- 

alpha (PCT Pub. No. WO 90/14363) and pFastBacDual (Gibco-BRL, Grand Island, 
NY). 

Additional suitable vectors include, but are not limited to, cosmids, plasmids, 
or modified viruses, but it will be appreciated that the vector system must be 

3 0 compatible with the selected host cell. Such vectors include, but are not limited to 

plasmids such as Bluescript® plasmid derivatives (a high copy number ColEl -based 
phagemid; Stratagene Cloning Systems, La Jolla CA), PCR cloning plasmids 
designed for cloning Taq-amplified PCR products (e.g., TOPO™ TA Cloning® Kit 
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may contain a dominantly acting selection gene. Other suitable mammalian cell lines 
include but are not limited to, mouse neuroblastoma N2A cells, HeLa, mouse L-929 
cells, 3T3 lines derived from Swiss, Balb-c or NIH mice, BHK or HaK hamster cell 
lines. Each of these cell lines is known by and available to those skilled in the art of 
5 protein expression. 

Similarly useful as host cells suitable for the present invention are bacterial 
cells. For example, the various strains of E. coli (e.g., HB101, DH5ot, DH10, and 
MCI 061) are well-known as host cells in the field of biotechnology. Various strains 
of B. subtilis^ Psendomonas spp., other Bacillus spp., Streptomyces spp., and the like 
1 0 may also be employed in this method. 

Many strains of yeast cells known to those skilled in the art are also available 
as host cells for the expression of the polypeptides of the present invention. Preferred 
yeast cells include, for example, Saccharomyces cerivisae and Pichia pastoris. 

Additionally, where desired, insect cell systems may be utilized in the 
15 methods of the present invention. Such systems are described, for example, in Kitts 
et al 9 1993, Biotechniques, 14:810-17; Lucklow, 1993, Curr. Opin. Biotechnol 
4:564-72; and Lucklow et aL, 1993, J. Virol, 67:4566-79. Preferred insect cells are 
Sf-9 and Hi5 (Invitrogen). 

One may also use transgenic animals to express glycosylated LGR8 
2 0 polypeptides. For example, one may use a transgenic milk-producing animal (a cow 
or goat, for example) and obtain the present glycosylated polypeptide in the animal 
milk. One may also use plants to produce LGR8 polypeptides, however, in general, 
the glycosylation occurring in plants is different from that produced in mammalian 
cells, and may result in a glycosylated product which is not suitable for human 

2 5 therapeutic use. 

Polypeptide Production 

Host cells comprising an LGR8 polypeptide expression vector may be 
cultured using standard media well known to the skilled artisan. The media will 

3 0 usually contain all nutrients necessary for the growth and survival of the cells. 

Suitable media for culturing E. coli cells include, for example, Luria Broth (LB) 
and/or Terrific Broth (TB). Suitable media for culturing eukaryotic cells include 
Roswell Park Memorial Institute medium 1640 (RPMI 1640), Minimal Essential 
Medium (MEM) and/or Dulbecco's Modified Eagle Medium (DMEM), all of which 
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LGR8 polypeptide can then be analyzed using gel electrophoresis, 
immunoprecipitation, or the like. If it is desired to isolate the LGR8 polypeptide, 
isolation may be accomplished using standard methods such as those described herein 
and in Marston et aL, 1990, Meth. E?tz., 182:264-75. 
5 In some cases, an LGR8 polypeptide may not be biologically active upon 

isolation. Various methods for "refolding" or converting the polypeptide to its 
tertiary structure and generating disulfide linkages can be used to restore biological 
activity. Such methods include exposing the solubilized polypeptide to a pH usually 
above 7 and in the presence of a particular concentration of a chaotrope. The 

10 selection of chaotrope is very similar to the choices used for inclusion body 
solubilization, but usually the chaotrope is used at a lower concentration and is not 
necessarily the same as chaotropes used for the solubilization. In most cases the 
refolding/oxidation solution will also contain a reducing agent or the reducing agent 
plus its oxidized form in a specific ratio to generate a particular redox potential 

15 allowing for disulfide shuffling to occur in the formation of the protein's cysteine 
bridges. Some of the commonly used redox couples include cysteine/cystamine, 
glutathione (GSH)/dithiobis GSH, cupric chloride, dithiothreitol(DTT)/dithiane DTT, 
and 2-2-mercaptoethanol(bME)/dithio-b(ME). In many instances, a cosolvent may be 
used or may be needed to increase the efficiency of the refolding, and the more 

2 0 common reagents used for this purpose include glycerol, polyethylene glycol of 
various molecular weights, arginine and the like. 

If inclusion bodies are not formed to a significant degree upon expression of 
an LGR8 polypeptide, then the polypeptide will be found primarily in the supernatant 
after centrifugation of the cell homogenate. The polypeptide may be further isolated 

2 5 from the supernatant using methods such as those described herein. 

The purification of an LGR8 polypeptide from solution can be accomplished 
using a variety of techniques. If the polypeptide has been synthesized such that it 
contains a tag such as Hexahistidine (LGR8 polypeptide/hexaHis) or other small 
peptide such as FLAG (Eastman Kodak Co., New Haven, CT) or myc (Invitrogen) at 

3 0 either its carboxyl- or amino-terminus, it may be purified in a one-step process by 

passing the solution through an affinity column where the column matrix has a high 
affinity for the tag. 

For example, polyhistidine binds with great affinity and specificity to nickel. 
Thus, an affinity column of nickel (such as the Qiagen® nickel columns) can be used 
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an mRNA and its encoded peptide. See also, Roberts, 1999, Curr. Opin. Chem. Biol. 
3:268-73. Additionally, U.S. Patent No. 5,824,469 describes methods for obtaining 
oligonucleotides capable of carrying out a specific biological function. The procedure 
involves generating a heterogeneous pool of oligonucleotides, each having a 5' 
5 randomized sequence, a central preselected sequence, and a 3' randomized sequence. 
The resulting heterogeneous pool is introduced into a population of cells that do not 
exhibit the desired biological function. Subpopulations of the cells are then screened 
for those that exhibit a predetermined biological function. From that subpopulation, 
oligonucleotides capable of carrying out the desired biological function are isolated. 

10 U.S. Patent Nos. 5,763,192; 5,814,476; 5,723,323; and 5,817,483 describe 

processes for producing peptides or polypeptides. This is done by producing 
stochastic genes or fragments thereof, and then introducing these genes into host cells 
which produce one or more proteins encoded by the stochastic genes. The host cells 
are then screened to identify those clones producing peptides or polypeptides having 

1 5 the desired activity. 

Another method for producing peptides or polypeptides is described in 
PCT/US98/20094 (WO99/15650) filed by Athersys, Inc. Known as "Random 
Activation of Gene Expression for Gene Discovery" (RAGE-GD), the process 
involves the activation of endogenous gene expression or over-expression of a gene 

2 0 by in situ recombination methods. For example, expression of an endogenous gene is 
activated or increased by integrating a regulatory sequence into the target cell that is 
capable of activating expression of the gene by non-homologous or illegitimate 
recombination. The target DNA is first subjected to radiation, and a genetic promoter 
inserted. The promoter eventually locates a break at the front of a gene, initiating 

2 5 transcription of the gene. This results in expression of the desired peptide or 

polypeptide. 

It will be appreciated that these methods can also be used to create 
comprehensive LGR8 polypeptide expression libraries, which can subsequently be 
used for high throughput phenotypic screening in a variety of assays, such as 

3 0 biochemical assays, cellular assays, and whole organism assays (e.g., plant, mouse, 

etc.). 

Synthesis 
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monoclonal antibodies include the hybridoma methods of Kohler et al, 1975, Nature 
256:495-97 and the human B-cell hybridoma method (Kozbor, 1984, J. Immunol 
133:3001; Brodeur et al, Monoclonal Antibody Production Techniques and 
Applications 51-63 (Marcel Dekker, Inc., 1987). Also provided by the invention are 
5 hybridoma cell lines that produce monoclonal antibodies reactive with LGR8 
polypeptides. 

Monoclonal antibodies of the invention may be modified for use as 
therapeutics. One embodiment is a "chimeric" antibody in which a portion of the 
heavy (H) and/or light (L) chain is identical with or homologous to a corresponding 

10 sequence in antibodies derived from a particular species or belonging to a particular 
antibody class or subclass, while the remainder of the chain(s) is/ are identical with or 
homologous to a corresponding sequence in antibodies derived from another species 
or belonging to another antibody class or subclass. Also included are fragments of 
such antibodies, so long as they exhibit the desired biological activity. See U.S. 

15 Patent No. 4,816,567; Morrison et al, 1985, Proc. Natl Acad, Sci. 81:6851-55. 

In another embodiment, a monoclonal antibody of the invention is a 
"humanized" antibody. Methods for humanizing non-human antibodies are well 
known in the art. See U.S. Patent Nos. 5,585,089 and 5,693,762. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a 

2 0 source that is non-human. Humanization can be performed, for example, using 
methods described in the art (Jones et al, 1986, Nature 321 : 522-25; Riechmann et al, 
1998, Nature 332:323-27; Verhoeyen et al, 1988, Science 239:1534-36), by 
substituting at least a portion of a rodent complementarity-determining region for the 
corresponding regions of a human antibody. 

2 5 Also encompassed by the invention are human antibodies that bind LGR8 

polypeptides. Using transgenic animals {e.g., mice) that are capable of producing a 
repertoire of human antibodies in the absence of endogenous immunoglobulin 
production such antibodies are produced by immunization with an LGR8 polypeptide 
antigen {i.e., having at least 6 contiguous amino acids), optionally conjugated to a 

30 carrier. See, e.g., Jakobovits et al, 1993, Proc. Natl Acad. Sci. 90:2551-55; 
Jakobovits et al, 1993, Nature 362:255-58; Bruggermann et al, 1993, Year in 
Immuno. 7:33. In one method, such transgenic animals are produced by 
incapacitating the endogenous loci encoding the heavy and light immunoglobulin 
chains therein, and inserting loci encoding human heavy and light chain proteins into 
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the detectable moiety may be a radioisotope, such as 3 H, 14 C, 32 P, 35 S, l25 I, ?9 Tc, 11 'in, 
or 67 Ga; a fluorescent or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine, or luciferin; or an enzyme, such as alkaline phosphatase. 
(3-galactosidase, or horseradish peroxidase (Bayer, et aL, 1990, Meth. Enz. 184:138- 
5 63). 

Competitive binding assays rely on the ability of a labeled standard (e.g., an 
LGR8 polypeptide, or an immunologically reactive portion thereof) to compete with 
the test sample analyte (an LGR8 polypeptide) for binding with a limited amount of 
anti-LGR8 antibody. The amount of an LGR8 polypeptide in the test sample is 

10 inversely proportional to the amount of standard that becomes bound to the 
antibodies. To facilitate determining the amount of standard that becomes bound, the 
antibodies typically are insolubilized before or after the competition, so that the 
standard and analyte that are bound to the antibodies may conveniently be separated 
from the standard and analyte that remain unbound. 

15 Sandwich assays typically involve the use of two antibodies, each capable of 

binding to a different immunogenic portion, or epitope, of the protein to be detected 
and/or quantitated. In a sandwich assay, the test sample analyte is typically bound by 
a first antibody that is immobilized on a solid support, and thereafter a second 
antibody binds to the analyte, thus forming an insoluble three-part complex. See, e.g., 

2 0 U.S. Patent No. 4,376,110. The second antibody may itself be labeled with a 
detectable moiety (direct sandwich assays) or may be measured using an anti- 
immunoglobulin antibody that is labeled with a detectable moiety (indirect sandwich 
assays). For example, one type of sandwich assay is an enzyme-linked 
immunosorbent assay (ELISA), in which case the detectable moiety is an enzyme. 

2 5 The selective binding agents, including anti-LGR8 antibodies, are also useful 

for in vivo imaging. An antibody labeled with a detectable moiety may be 
administered to an animal, preferably into the bloodstream, and the presence and 
location of the labeled antibody in the host assayed. The antibody may be labeled 
with any moiety that is detectable in an animal, whether by nuclear magnetic 

3 0 resonance, radiology, or other detection means known in the art. 

Selective binding agents of the invention, including antibodies, may be used as 
therapeutics. These therapeutic agents are generally agonists or antagonists, in that 
they either enhance or reduce, respectively, at least one of the biological activities of 
an LGR8 polypeptide. In one embodiment, antagonist antibodies of the invention are 
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of populations and generation of surrogate markers for clinical trials; and enhancing 
related LGR8 polypeptide small molecule drug discovery by aiding in the 
identification of selective compounds in high throughput screens. 

5 Chemical Derivatives 

Chemically modified derivatives of LGR8 polypeptides may be prepared by 
one skilled in the art, given the disclosures described herein. LGR8 polypeptide 
derivatives are modified in a manner that is different - either in the type or location of 
the molecules naturally attached to the polypeptide. Derivatives may include 

10 molecules formed by the deletion of one or more naturally-attached chemical groups. 
The polypeptide comprising the amino acid sequence of any of SEQ ID NO: 2, SEQ 
ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23, or other LGR8 polypeptide, may be 

15 modified by the covalent attachment of one or more polymers. For example, the 
polymer selected is typically water-soluble so that the protein to which it is attached 
does not precipitate in an aqueous environment, such as a physiological environment. 
Included within the scope of suitable polymers is a mixture of polymers. Preferably, 
for therapeutic use of the end-product preparation, the polymer will be 

2 0 pharmaceutically acceptable. 

The polymers each may be of any molecular weight and may be branched or 
unbranched. The polymers each typically have an average molecular weight of 
between about 2 kDa to about 100 kDa (the term "about" indicating that in 
preparations of a water-soluble polymer, some molecules will weigh more, some less, 

2 5 than the stated molecular weight). The average molecular weight of each polymer is 

preferably between about 5 kDa and about 50 kDa, more preferably between about 12 
kDa and about 40 kDa and most preferably between about 20 kDa and about 35 kDa. 

Suitable water-soluble polymers or mixtures thereof include, but are not 
limited to, N-linked or O-linked carbohydrates, sugars, phosphates, polyethylene 

3 0 glycol (PEG) (including the forms of PEG that have been used to derivatize proteins, 

including mono-(C]-Cio), alkoxy-, or aryloxy-poly ethylene glycol), monomethoxy- 
polyethylene glycol, dextran (such as low molecular weight dextran of, for example, 
about 6 kD), cellulose, or other carbohydrate based polymers, poly-(N-vinyl 
pyrrolidone) polyethylene glycol, propylene glycol homopolymers, polypropylene 
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(TNP) and the resulting conjugates precipitated with anti-DNP or anti-TNP-IgM to 
form decameric conjugates with a valency of 10. 

Generally, conditions that may be alleviated or modulated by the 
administration of the present LGR8 polypeptide derivatives include those described 
5 herein for LGR8 polypeptides. However, the LGR8 polypeptide derivatives disclosed 
herein may have additional activities, enhanced or reduced biological activity, or 
other characteristics, such as increased or decreased half-life, as compared to the non- 
derivatized molecules. 

10 Genetically Engineered Non-Human Animals 

Additionally included within the scope of the present invention are non-human 
animals such as mice, rats, or other rodents; rabbits, goats, sheep, or other farm 
animals, in which the genes encoding native LGR8 polypeptide have been disrupted 
(i.e., "knocked out") such that the level of expression of LGR8 polypeptide is 

15 significantly decreased or completely abolished. Such animals may be prepared using 
techniques and methods such as those described in U.S. Patent No. 5,557,032. 

The present invention further includes non-human animals such as mice, rats, 
or other rodents; rabbits, goats, sheep, or other farm animals, in which either the 
native form of an LGR8 gene for that animal or a heterologous LGR8 gene is over- 

2 0 expressed by the animal, thereby creating a "transgenic" animal. Such transgenic 
animals may be prepared using well known methods such as those described in U.S. 
Patent No 5,489,743 and PCT Pub. No. WO 94/28122. 

The present invention further includes non-human animals in which the 
promoter for one or more of the LGR8 polypeptides of the present invention is either 

2 5 activated or inactivated (e.g. , by using homologous recombination methods) to alter 

the level of expression of one or more of the native LGR8 polypeptides. 

These non-human animals may be used for drug candidate screening. In such 
screening, the impact of a drug candidate on the animal may be measured. For 
example, drug candidates may decrease or increase the expression of the LGR8 gene. 

3 0 In certain embodiments, the amount of LGR8 polypeptide that is produced may be 

measured after the exposure of the animal to the drug candidate. Additionally, in 
certain embodiments, one may detect the actual impact of the drug candidate on the 
animal. For example, over-expression of a particular gene may result in, or be 
associated with, a disease or pathological condition. In such cases, one may test a 
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sequences which direct or control the expression of LGR8 polypeptide, and which act 
as anti-sense regulators of expression. 

Once a test molecule has been identified as interacting with an LGRS 
polypeptide, the molecule may be further evaluated for its ability to increase or 
5 decrease LGR8 polypeptide activity. The measurement of the interaction of a test 
molecule with LGR8 polypeptide may be carried out in several formats, including 
cell-based binding assays, membrane binding assays, solution-phase assays, and 
immunoassays. In general, a test molecule is incubated with an LGR8 polypeptide 
for a specified period of time, and LGRS polypeptide activity is determined by one or 

1 0 more assays for measuring biological activity. 

The interaction of test molecules with LGR8 polypeptides may also be 
assayed directly using polyclonal or monoclonal antibodies in an immunoassay. 
Alternatively, modified forms of LGR8 polypeptides containing epitope tags as 
described herein may be used in solution and immunoassays. 

15 In the event that LGR8 polypeptides display biological activity through an 

interaction with a binding partner (e.g., a receptor or a ligand), a variety of in vitro 
assays may be used to measure the binding of an LGRS polypeptide to the 
corresponding binding partner (such as a selective binding agent, receptor, or ligand). 
These assays may be used to screen test molecules for their ability to increase or 

2 0 decrease the rate and/or the extent of binding of an LGR8 polypeptide to its binding 
partner. In one assay, an LGRS polypeptide is immobilized in the wells of a 
microtiter plate. Radiolabeled LGR8 polypeptide binding partner (for example, 
iodinated LGR8 polypeptide binding partner) and a test molecule can then be added 
either one at a time (in either order) or simultaneously to the wells. After incubation, 

2 5 the wells can be washed and counted for radioactivity, using a scintillation counter, to 

determine the extent to which the binding partner bound to the LGR8 polypeptide. 
Typically, a molecule will be tested over a range of concentrations, and a series of 
control wells lacking one or more elements of the test assays can be used for accuracy 
in the evaluation of the results. An alternative to this method involves reversing the 

3 0 "positions" of the proteins, i.e., immobilizing LGR8 polypeptide binding partner to 

the microtiter plate wells, incubating with the test molecule and radiolabeled LGRS 
polypeptide, and determining the extent of LGR8 polypeptide binding. See, e.g., 
Current Protocols in Molecular Biology, chap. 18 (Ausubel et al, eds., Green 
Publishers Inc. and Wiley and Sons 1995). 
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an LGR8 polypeptide and an LGR8 polypeptide binding partner. In these cases, the 
assays set forth herein can be readily modified by adding such additional test 
compound(s) either simultaneously with, or subsequent to, the first test compound. 
The remainder of the steps in the assay are as set forth herein. 
5 In vitro assays such as those described herein may be used advantageously to 

screen large numbers of compounds for an effect on the formation of a complex 
between an LGR8 polypeptide and LGR8 polypeptide binding partner. The assays 
may be automated to screen compounds generated in phage display, synthetic peptide, 
and chemical synthesis libraries. 

1 o Compounds which increase or decrease the formation of a complex between 

an LGR8 polypeptide and an LGR8 polypeptide binding partner may also be screened 
in cell culture using cells and cell lines expressing either LGR8 polypeptide or LGR8 
polypeptide binding partner. Cells and cell lines may be obtained from any mammal, 
but preferably will be from human or other primate, canine, or rodent sources. The 
15 binding of an LGR8 polypeptide to cells expressing LGR8 polypeptide binding 
partner at the surface is evaluated in the presence or absence of test molecules, and 
the extent of binding may be determined by, for example, flow cytometry using a 
biotinylated antibody to an LGR8 polypeptide binding partner. Cell culture assays 
can be used advantageously to further evaluate compounds that score positive in 

2 0 protein binding assays described herein. 

Cell cultures can also be used to screen the impact of a drug candidate. For 
example, drug candidates may decrease or increase the expression of the LGR8 gene. 
In certain embodiments, the amount of LGR8 polypeptide or an LGR8 polypeptide 
fragment that is produced may be measured after exposure of the cell culture to the 

2 5 drug candidate. In certain embodiments, one may detect the actual impact of the drug 

candidate on the cell culture. For example, the over-expression of a particular gene 
may have a particular impact on the cell culture. In such cases, one may test a drug 
candidate's ability to increase or decrease the expression of the gene or its ability to 
prevent or inhibit a particular impact on the cell culture. In other examples, the 

3 0 production of a particular metabolic product such as a fragment of a polypeptide, may 

result in, or be associated with, a disease or pathological condition. In such cases, one 
may test a drug candidate's ability to decrease the production of such a metabolic 
product in a cell culture. 
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In accordance with certain embodiments of the invention, it may be useful to 
be able to determine the source of a certain cell type associated with an LGRS 
polypeptide. For example, it may be useful to determine the origin of a disease or 
pathological condition as an aid in selecting an appropriate therapy. In certain 
5 embodiments, nucleic acids encoding an LGR8 polypeptide can be used as a probe to 
identify cells described herein by screening the nucleic acids of the cells with such a 
probe. In other embodiments, one may use anti-LGR8 polypeptide antibodies to test 
for the presence of LGR8 polypeptide in cells, and thus, determine if such cells are of 
the types described herein. 

10 

LGR8 Polypeptide Compositions and Administration 

Therapeutic compositions are within the scope of the present invention. Such 
LGR8 polypeptide pharmaceutical compositions may comprise a therapeutically 
effective amount of an LGRS polypeptide or an LGR8 nucleic acid molecule in 

15 admixture with a pharmaceutically or physiologically acceptable formulation agent 
selected for suitability with the mode of administration. Pharmaceutical compositions 
may comprise a therapeutically effective amount of one or more LGR8 polypeptide 
selective binding agents in admixture with a pharmaceutically or physiologically 
acceptable formulation agent selected for suitability with the mode of administration. 

2 0 Acceptable formulation materials preferably are nontoxic to recipients at the 

dosages and concentrations employed. 

The pharmaceutical composition may contain formulation materials for 
modifying, maintaining, or preserving, for example, the pH, osmolality, viscosity, 
clarity, color, isotonicity, odor, sterility, stability, rate of dissolution or release, 

2 5 adsorption, or penetration of the composition. Suitable formulation materials include, 

but are not limited to, amino acids (such as glycine, glutamine, asparagine, arginine, 
or lysine), antimicrobials, antioxidants (such as ascorbic acid, sodium sulfite, or 
sodium hydrogen-sulfite), buffers (such as borate, bicarbonate, Tris-HCl, citrates, 
phosphates, or other organic acids), bulking agents (such as mannitol or glycine), 

3 0 chelating agents (such as ethylenediamine tetraacetic acid (EDTA)), complexing 

agents (such as caffeine, polyvinylpyrrolidone, beta-cyclodextrin, or hydroxypropyl- 
beta-cyclodextrin), fillers, monosaccharides, disaccharides, and other carbohydrates 
(such as glucose, mannose, or dextrins), proteins (such as serum albumin, gelatin, or 
immunoglobulins), coloring, flavoring and diluting agents, emulsifying agents, 
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pharmaceutically acceptable compositions is within the skill of the art. 

The formulation components are present in concentrations that are acceptable 
to the site of administration. For example, buffers are used to maintain the 
composition at physiological pH or at a slightly lower pH, typically within a pH range 
5 of from about 5 to about 8. 

When parenteral administration is contemplated, the therapeutic compositions 
for use in this invention may be in the form of a pyrogen- free, parenterally acceptable, 
aqueous solution comprising the desired LGR8 molecule in a pharmaceutically 
acceptable vehicle. A particularly suitable vehicle for parenteral injection is sterile 

10 distilled water in which an LGR8 molecule is formulated as a sterile, isotonic 
solution, properly preserved. Yet another preparation can involve the formulation of 
the desired molecule with an agent, such as injectable microspheres, bio-erodible 
particles, polymeric compounds (such as polylactic acid or polyglycolic acid), beads, 
or liposomes, that provides for the controlled or sustained release of the product 

15 which may then be delivered via a depot injection. Hyaluronic acid may also be used, 
and this may have the effect of promoting sustained duration in the circulation. Other 
suitable means for the introduction of the desired molecule include implantable drug 
delivery devices. 

In one embodiment, a pharmaceutical composition may be formulated for 
2 0 inhalation. For example, LGR8 polypeptide may be formulated as a dry powder for 
inhalation. LGR8 polypeptide or nucleic acid molecule inhalation solutions may also 
be formulated with a propellant for aerosol delivery. In yet another embodiment, 
solutions may be nebulized. Pulmonary administration is further described in PCT 
Pub. No. WO 94/20069, which describes the pulmonary delivery of chemically 

2 5 modified proteins. 

It is also contemplated that certain formulations may be administered orally. 
In one embodiment of the present invention, LGR8 polypeptides that are administered 
in this fashion can be formulated with or without those carriers customarily used in 
the compounding of solid dosage forms such as tablets and capsules. For example, a 

3 0 capsule may be designed to release the active portion of the formulation at the point in 

the gastrointestinal tract when bioavailability is maximized and pre-systemic 
degradation is minimized. Additional agents can be included to facilitate absorption 
of the LGR8 polypeptide. Diluents, flavorings, low melting point waxes, vegetable 
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lyophilized form or in a solution. In addition, parenteral compositions generally are 
placed into a container having a sterile access port, for example, an intravenous 
solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

Once the pharmaceutical composition has been formulated, it may be stored in 
5 sterile vials as a solution, suspension, gel, emulsion, solid, or as a dehydrated or 
lyophilized powder. Such formulations may be stored either in a ready-to-use form or 
in a form (e.g., lyophilized) requiring reconstitution prior to administration. 

In a specific embodiment, the present invention is directed to kits for 
producing a single-dose administration unit. The kits may each contain both a first 

1 0 container having a dried protein and a second container having an aqueous 
formulation. Also included within the scope of this invention are kits containing 
single and multi-chambered pre-filled syringes (e.g., liquid syringes and lyosyringes). 

The effective amount of an LGR8 pharmaceutical composition to be employed 
therapeutically will depend, for example, upon the therapeutic context and objectives. 

15 One skilled in the art will appreciate that the appropriate dosage levels for treatment 
will thus vary depending, in part, upon the molecule delivered, the indication for 
which the LGR8 molecule is being used, the route of administration, and the size 
(body weight, body surface, or organ size) and condition (the age and general health) 
of the patient. Accordingly, the clinician may titer the dosage and modify the route of 

2 0 administration to obtain the optimal therapeutic effect. A typical dosage may range 
from about 0.1 fig/kg to up to about 100 mg/kg or more, depending on the factors 
mentioned above. In other embodiments, the dosage may range from 0. 1 |J.g/kg up to 
about 100 mg/kg; or 1 p-g/kg up to about 100 mg/kg; or 5 \xg/kg up to about 100 
mg/kg. 

2 5 The frequency of dosing will depend upon the pharmacokinetic parameters of 

the LGR8 molecule in the formulation being used. Typically, a clinician will 
administer the composition until a dosage is reached that achieves the desired effect. 
The composition may therefore be administered as a single dose, as two or more 
doses (which may or may not contain the same amount of the desired molecule) over 

3 0 time, or as a continuous infusion via an implantation device or catheter. Further 

refinement of the appropriate dosage is routinely made by those of ordinary skill in 
the art and is within the ambit of tasks routinely performed by them. Appropriate 
dosages may be ascertained through use of appropriate dose-response data. 



-69- 



W O 02/14489 



PCT/US01/25054 



polypeptides. 

Homologous recombination is a technique originally developed for targeting 
genes to induce or correct mutations in transcriptionally active genes. Kucherlapati, 
1989, Prog, in Nucl Acid Res. & Mol Biol 36:301. The basic technique was 
5 developed as a method for introducing specific mutations into specific regions of the 
mammalian genome (Thomas et al, 1986, Cell 44:419-28; Thomas and Capecchi, 
1987, Cell 51:503-12; Doetschman et al, 1988, Proc. Natl Acad. Set U.S.A. 85:8583- 
87) or to correct specific mutations within defective genes (Doetschman et ah, 1987, 
Nature 330:576-78). Exemplary homologous recombination techniques are described 

10 in U.S. Patent No. 5,272,071; European Patent Nos. 9193051 and 505500; 
PCT/US90/07642, and PCT Pub No. WO 91/09955). 

Through homologous recombination, the DNA sequence to be inserted into the 
genome can be directed to a specific region of the gene of interest by attaching it to 
targeting DNA. The targeting DNA is a nucleotide sequence that is complementary 

15 (homologous) to a region of the genomic DNA. Small pieces of targeting DNA that 
are complementary to a specific region of the genome are put in contact with the 
parental strand during the DNA replication process. It is a general property of DNA 
that has been inserted into a cell to hybridize, and therefore, recombine with other 
pieces of endogenous DNA through shared homologous regions. If this 

2 0 complementary strand is attached to an oligonucleotide that contains a mutation or a 
different sequence or an additional nucleotide, it too is incorporated into the newly 
synthesized strand as a result of the recombination. As a result of the proofreading 
function, it is possible for the new sequence of DNA to serve as the template. Thus, 
the transferred DNA is incorporated into the genome. 

2 5 Attached to these pieces of targeting DNA are regions of DNA that may 

interact with or control the expression of an LGR8 polypeptide, e.g., flanking 
sequences. For example, a promoter/enhancer element, a suppressor, or an exogenous 
transcription modulatory element is inserted in the genome of the intended host cell in 
proximity and orientation sufficient to influence the transcription of DNA encoding 

3 0 the desired LGR8 polypeptide. The control element controls a portion of the DNA 

present in the host cell genome. Thus, the expression of the desired LGR8 
polypeptide may be achieved not by transfection of DNA that encodes the LGR8 gene 
itself, but rather by the use of targeting DNA (containing regions of homology with 
the endogenous gene of interest) coupled with DNA regulatory segments that provide 
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properly positioned in this plasmid, would integrate in such a manner as to create a 
new or modified transcriptional unit resulting in de novo or increased LGR8 
polypeptide production from the cell's endogenous LGR8 gene. 

A further method to use the ceil line in which the site specific recombination 
5 sequence had been placed just upstream of the cell's endogenous genomic LGRS 
polypeptide coding region is to use homologous recombination to introduce a second 
recombination site elsewhere in the cell line's genome. The appropriate recombinase 
enzyme is then introduced into the two-recombination-site cell line, causing a 
recombination event (deletion, inversion, and translocation) (Sauer, 1994, Cwt. Opin. 

10 BiotechnoL, 5:521-27; Sauer, 1993, Methods Enzymol, 225:890-900) that would 
create a new or modified transcriptional unit resulting in de novo or increased LGR8 
polypeptide production from the cell's endogenous LGR8 gene. 

An additional approach for increasing, or causing, the expression of LGRS 
polypeptide from a cell's endogenous LGR8 gene involves increasing, or causing, the 

15 expression of a gene or genes (e.g., transcription factors) and/or decreasing the 
expression of a gene or genes (e.g., transcriptional repressors) in a manner which 
results in de novo or increased LGR8 polypeptide production from the cell's 
endogenous LGR8 gene. This method includes the introduction of a non-naturally 
occurring polypeptide (e.g., a polypeptide comprising a site specific DNA binding 

2 0 domain fused to a transcriptional factor domain) into the cell such that de novo or 
increased LGR8 polypeptide production from the cell's endogenous LGRS gene 
results. 

The present invention further relates to DNA constructs useful in the method 
of altering expression of a target gene. In certain embodiments, the exemplary DNA 

2 5 constructs comprise: (a) one or more targeting sequences, (b) a regulatory sequence, 

(c) an exon, and (d) an unpaired splice-donor site. The targeting sequence in the DNA 
construct directs the integration of elements (a) - (d) into a target gene in a cell such 
that the elements (b) - (d) are operatively linked to sequences of the endogenous target 
gene. In another embodiment, the DNA constructs comprise: (a) one or more 

3 0 targeting sequences, (b) a regulatory sequence, (c) an exon, (d) a splice-donor site, (e) 

an intron, and (f) a splice-acceptor site, wherein the targeting sequence directs the 
integration of elements (a) - (f) such that the elements of (b) - (f) are operatively 
linked to the endogenous gene. The targeting sequence is homologous to the 
preselected site in the cellular chromosomal DNA with which homologous 
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Alternatively, the patient's own cells, transformed to produce LGR8 polypeptides ex 
vivo, may be implanted directly into the patient without such encapsulation. 

Techniques for the encapsulation of living cells are known in the art, and the 
preparation of the encapsulated cells and their implantation in patients may be 
5 routinely accomplished. For example, Baetge et ah (PCT Pub. No, WO 95/05452 and 
PCT/US94/09299) describe membrane capsules containing genetically engineered 
cells for the effective delivery of biologically active molecules. The capsules are 
biocompatible and are easily retrievable. The capsules encapsulate cells transfected 
with recombinant DNA molecules comprising DNA sequences coding for biologically 

10 active molecules operatively linked to promoters that are not subject to down- 
regulation in vivo upon implantation into a mammalian host. The devices provide for 
the delivery of the molecules from living ceils to specific sites within a recipient. In 
addition, see U.S. Patent Nos. 4,892,538; 5,011,472; and 5,106,627. A system for 
encapsulating living cells is described in PCT Pub. No. WO 91/10425 (Aebischer et 

15 ah). See also, PCT Pub. No. WO 91/10470 (Aebischer et ah); Winn et ah, 1991, 
Exper. Neurol. 113:322-29; Aebischer et ah, 1991, Exper. Neurol 111:269-75; and 
Tresco et ah, 1992, ASAIO 38:17-23. 

In vivo and in vitro gene therapy delivery of LGR8 polypeptides is also 
envisioned. One example of a gene therapy technique is to use the LGR8 gene (either 

2 0 genomic DNA, cDNA, and/or synthetic DNA) encoding an LGR8 polypeptide that 
may be operably linked to a constitutive or inducible promoter to form a "gene 
therapy DNA construct." The promoter may be homologous or heterologous to the 
endogenous LGR8 gene, provided that it is active in the cell or tissue type into which 
the construct will be inserted. Other components of the gene therapy DNA construct 

2 5 may optionally include DNA molecules designed for site-specific integration (e.g., 

endogenous sequences useful for homologous recombination), tissue-specific 
promoters, enhancers or silencers, DNA molecules capable of providing a selective 
advantage over the parent cell, DNA molecules useful as labels to identify 
transformed cells, negative selection systems, cell specific binding agents (as, for 

3 0 example, for cell targeting), cell-specific internalization factors, transcription factors 

enhancing expression from a vector, and factors enabling vector production. 

A gene therapy DNA construct can then be introduced into cells (either ex vivo 
or in vivo) using viral or non-viral vectors. One means for introducing the gene 
therapy DNA construct is by means of viral vectors as described herein. Certain 
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ecdysone-responsive gene). The ecdysone receptor includes a transactivation domain, 
DNA-binding domain, and ligand-binding domain to initiate transcription. The 
ecdysone system is further described in U.S. Patent No. 5,514,578 and PCT Pub. Nos. 
WO 97/381 17, WO 96/37609, and WO 93/03162. 
5 Another control means uses a positive tetracycline-controllable transactivator. 

This system involves a mutated tet repressor protein DNA-binding domain (mutated 
tet R-4 amino acid changes which resulted in a reverse tetracycline-regulated 
transactivator protein, i.e., it binds to a tet operator in the presence of tetracycline) 
linked to a polypeptide which activates transcription. Such systems are described in 

10 U.S. Patent Nos. 5,464,758, 5,650,298, and 5,654,168. 

Additional expression control systems and nucleic acid constructs are 
described in U.S. Patent Nos. 5,741,679 and 5,834,186, to Iimovir Laboratories Inc. 

In vivo gene therapy may be accomplished by introducing the gene encoding 
LGR8 polypeptide into cells via local injection of an LGR8 nucleic acid molecule or 

15 by other appropriate viral or non- viral delivery vectors. Hefti 1994, Neurobiology 
25:1418-35. For example, a nucleic acid molecule encoding an LGR8 polypeptide 
may be contained in an adeno-associated virus (AAV) vector for delivery to the 
targeted cells (see, e.g., Johnson, PCT Pub. No. WO 95/34670; PCT App. No. 
PCT/US95/07178). The recombinant AAV genome typically contains AAV inverted 

2 0 terminal repeats flanking a DNA sequence encoding an LGR8 polypeptide operably 
linked to functional promoter and polyadenylation sequences. 

Alternative suitable viral vectors include, but axe not limited to, retrovirus, 
adenovirus, herpes simplex virus, lentivirus, hepatitis virus, parvovirus, papovavirus, 
poxvirus, alphavirus, coronavirus, rhabdovirus, paramyxovirus, and papilloma virus 

2 5 vectors. U.S. Patent No. 5,672,344 describes an in vivo viral-mediated gene transfer 

system involving a recombinant neurotrophic HSV-1 vector. U.S. Patent No. 
5,399,346 provides examples of a process for providing a patient with a therapeutic 
protein by the delivery of human cells that have been treated in vitro to insert a DNA 
segment encoding a therapeutic protein. Additional methods and materials for the 

3 0 practice of gene therapy techniques are described in U.S. Patent Nos. 5,631,236 

(involving adenoviral vectors), 5,672,510 (involving retroviral vectors), 5,635,399 
(involving retroviral vectors expressing cytokines). 

Nonviral delivery methods include, but are not limited to, liposome-mediated 
transfer, naked DNA delivery (direct injection), receptor-mediated transfer (ligand- 
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Gene therapy also can be used to decrease LGR8 polypeptide expression by 
modifying the nucleotide sequence of the endogenous promoter. Such modification is 
typically accomplished via homologous recombination methods. For example, a 
DNA molecule containing all or a portion of the promoter of the LGR8 gene selected 
5 for inactivation can be engineered to remove and/or replace pieces of the promoter 
that regulate transcription. For example, the TATA box and/or the binding site of a 
transcriptional activator of the promoter may be deleted using standard molecular 
biology techniques; such deletion can inhibit promoter activity thereby repressing the 
transcription of the corresponding LGR8 gene. The deletion of the TATA box or the 

10 transcription activator binding site in the promoter may be accomplished by 
generating a DNA construct comprising all or the relevant portion of the LGR8 
polypeptide promoter (from the same or a related species as the LGR8 gene to be 
regulated) in which one or more of the TATA box and/or transcriptional activator 
binding site nucleotides are mutated via substitution, deletion and/or insertion of one 

15 or more nucleotides. As a result, the TATA box and/or activator binding site has 
decreased activity or is rendered completely inactive. This construct, which also will 
typically contain at least about 500 bases of DNA that correspond to the native 
(endogenous) 5' and 3' DNA sequences adjacent to the promoter segment that has 
been modified, may be introduced into the appropriate cells (either ex vivo or in vivo) 

2 0 either directly or via a viral vector as described herein. Typically, the integration of 

the construct into the genomic DNA of the cells will be via homologous 
recombination, where the 5' and 3' DNA sequences in the promoter construct can 
serve to help integrate the modified promoter region via hybridization to the 
endogenous chromosomal DNA. 

25 

Therapeutic Uses 

LGR8 nucleic acid molecules, polypeptides, and agonists and antagonists 
thereof can be used to treat, diagnose, ameliorate, or prevent a number of diseases, 
disorders, or conditions, including those recited herein. 

3 0 LGR8 polypeptide agonists and antagonists include those molecules which 

regulate LGR8 polypeptide activity and either increase or decrease at least one 
activity of the mature form of the LGR8 polypeptide. Agonists or antagonists may be 
co-factors, such as a protein, peptide, carbohydrate, lipid, or small molecular weight 
molecule, which interact with LGR8 polypeptide and thereby regulate its activity. 
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Since LGR8 polypeptide expression has been detected in the uterus, LGR8 
nucleic acid molecules, polypeptides, and agonists and antagonists thereof may be 
useful in diagnosing or treating diseases and conditions affecting skeletal muscle. 
Examples of such diseases and conditions include, but are not limited to, miscarriage, 
5 endometriosis, uterine cancer, and female infertility. Other diseases and conditions 
associated with uterine development and function are encompassed within the scope 
of this invention. 

Since LGR8 polypeptide expression has been detected in the adrenal gland, 
LGRS nucleic acid molecules, polypeptides, and agonists and antagonists thereof may 
10 be useful in diagnosing or treating diseases and conditions affecting the adrenal gland. 
Examples of such diseases and conditions include, but are not limited to, Cushing's 
disease and Addison's disease. Other diseases and conditions associated with the 
development and function of the adrenal gland are encompassed within the scope of 
this invention. 

15 Since LGR8 polypeptide expression has been detected in the testes, LGRS 

nucleic acid molecules, polypeptides, and agonists and antagonists thereof may be 
useful in diagnosing or treating diseases and conditions affecting the testes. Examples 
of such diseases and conditions include, but are not limited to, male infertility and 
testicular carcinoma. Other diseases and conditions associated with the development 

2 0 and function of the testes are encompassed within the scope of this invention. 

Since LGR8 polypeptide expression has been detected in the bone marrow, 
LGR8 nucleic acid molecules, polypeptides, and agonists and antagonists thereof may 
be useful in diagnosing or treating diseases and conditions affecting the bone marrow. 
Examples of such diseases and conditions include, but are not limited to, leukemia. 

2 5 Other diseases and conditions associated with the development and function of the 

bone marrow are encompassed within the scope of this invention. 

Since LGR8 polypeptide expression has been detected in the fetal kidney, 
LGR8 nucleic acid molecules, polypeptides, and agonists and antagonists thereof may 
be useful in diagnosing or treating diseases and conditions affecting the kidney. 

3 0 Examples of such diseases and conditions include, but are not limited to, anemia, 

hypertension, and low blood pressure. Other diseases and conditions associated with 
the development and function of the kidney are encompassed within the scope of this 
invention. 
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Nucleic acid molecules of the invention (including those that do not 
themselves encode biologically active polypeptides) may be used to map the locations 
of the LGR8 gene and related genes on chromosomes. Mapping may be done by 
techniques known in the art, such as PCR amplification and in situ hybridization. 
5 LGR8 nucleic acid molecules (including those that do not themselves encode 

biologically active polypeptides), may be useful as hybridization probes in diagnostic 
assays to test, either qualitatively or quantitatively, for the presence of an LGR8 
nucleic acid molecule in mammalian tissue or bodily fluid samples. 

Other methods may also be employed where it is desirable to inhibit the 

10 activity of one or more LGR8 polypeptides. Such inhibition may be effected by 
nucleic acid molecules that are complementary to and hybridize to expression control 
sequences (triple helix formation) or to LGR8 mRNA. For example, antisense DNA 
or RNA molecules, which have a sequence that is complementary to at least a portion 
of an LGR8 gene can be introduced into the cell. Anti-sense probes may be designed 

15 by available techniques using the sequence of the LGR8 gene disclosed herein. 
Typically, each such antisense molecule will be complementary to the start site (5' 
end) of each selected LGR8 gene. When the antisense molecule then hybridizes to 
the corresponding LGR8 mRNA, translation of this mRNA is prevented or reduced. 
Anti-sense inhibitors provide information relating to the decrease or absence of an 

2 0 LGR8 polypeptide in a cell or organism. 

Alternatively, gene therapy may be employed to create a dominant-negative 
inhibitor of one or more LGR8 polypeptides. In this situation, the DNA encoding a 
mutant polypeptide of each selected LGR8 polypeptide can be prepared and 
introduced into the cells of a patient using either viral or non-viral methods as 

2 5 described herein. Each such mutant is typically designed to compete with 

endogenous polypeptide in its biological role. 

In addition, an LGR8 polypeptide, whether biologically active or not, may be 
used as an immunogen, that is, the polypeptide contains at least one epitope to which 
antibodies may be raised. Selective binding agents that bind to an LGR8 polypeptide 

3 0 (as described herein) may be used for in vivo and in vitro diagnostic purposes, 

including, but not limited to, use in labeled form to detect the presence of LGR8 
polypeptide in a body fluid or cell sample. The antibodies may also be used to 
prevent, treat, or diagnose a number of diseases and disorders, including those recited 
herein. The antibodies may bind to an LGR8 polypeptide so as to diminish or block 
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encoding the mature form (i.e., lacking the signal peptide) of human LGR8. These 
sequences were then used to design gene specific oligonucleotides for the 
identification of cDNA sources and the generation of cDNA clones, using various 
PCR strategies. Several highly homologous, but not identical, LGR8 sequences were 
5 thus isolated. An analysis of these sequences led to the identification of four 
nucleotide sequences encoding the mature forms (i.e., the forms lacking an LGRS 
start codon and the nucleotide sequence encoding the signal peptide) of LGR8-A, 
LGR8-B, LGR8-C, and LGR8-D. 

The nucleotide sequence encoding the mature form of human LGR8-A was 

10 obtained in amplification reactions using 5 jul of a human adrenal Marathon Ready 
cDNA template (Clontech Laboratories; Palo Alto, CA), 1.0 \xm each of the 
amplimers 5'-T-G-C-C-A-A-A-A-A-G-G-A-T-A-T-T-T-T-C-C-C-T-G-T-G-G-G-A- 
A-T-C-T-T-A-3' (SEQ ID NO: 27) and 5'-C-T-A-G-G-A-AA-C-T-G-G-T-T-T-C-A- 
T-T- A-T-A-C-T-G-T-C-T-C-C-A-A-G-T-G-T-T-A-T-T-T-T-G-T-T-C- A- 3 ' (SEQ ID 

15 NO: 28), 200 of dNTPs, 2.5 U of P/wTurbo DNA polymerase (Stratagene; La 
Jolla, CA), and 5 jjL 10X P/wTurbo DNA polymerase reaction buffer in a final 
volume of 50 \iL, Reactions were performed at 94°C for 1 minute for one cycle; 94°C 
for 10 seconds, 60°C for 20 seconds, and 72°C for 5 minutes for 45 cycles; and 72°C 
for 7 minutes for one cycle. 

2 0 The amplification mixture was separated on an agarose gel, the PCR products 

were isolated from the gel, and the products were then blunt-end cloned into pPCR- 
Script Amp SK(+) (Stratagene). A number of clones were sequenced, each containing 
highly homologous (but not identical) LGRS nucleotide sequences. These sequences 
were used to compile the four nucleotide sequences encoding the mature forms (i.e., 
25 the form lacking a start codon and a signal peptide) of LGR8-A, LGR8-B, LGR8-C, 
and LGR8-D. 

To isolate cDNA sequences corresponding to the 5' end of the cDNA 
sequence for the immature form of human LGR8, 5' RACE was performed using 5 
]iL of a human adrenal Marathon Ready cDNA template, 1.0 p.m each of the primers 

3 0 5'-C-C-A-T-C-C-T-A-A-T-A-C-G-A-C-T-C-A-C-T-A-T-A-G-G-G-C-3 9 (SEQ ID 

NO: 29) and 5'-A-T-T-G-T-C-A-T-C-T-A^G-A-A-T-T-A-G-C-C-A-A-G-T-T-A-G- 
C-T-G-A-T-3' (SEQ ID NO: 30), 200 |im of dNTPs, 1 nL of SOX Advantage2 
Polymerase Mix (Clontech Laboratories), and 5 pL 10X Advantage2 PCR buffer in a 
final volume of 50 |j,L. Reactions were performed at 94°C for 1 minute for one cycle; 
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cytoplasmic C-terminal region. Sequence analysis of the full-length LGR8-A coding 
sequence indicated that the cDNA comprises a 2262 bp open reading frame encoding 
a protein of 754 amino acids (Figures 1A-1D). The mature form of LGR8-A is 718 
amino acids in length. LGR8-A is most closely related to glycoprotein hormone 
5 receptor LGR7 (Figures 1 OA- 1 OB). 

The LGR8-B coding sequence is identical to the LGR8-A coding sequence 
with the exception that the LGR8-B coding sequence lacks a portion of the sequence 
encoding the N-terminal extracellular domain. Sequence analysis of the full-length 
LGR8-B coding sequence indicated that the cDNA comprises a 2190 bp open reading 
10 frame encoding a protein of 730 amino acids (Figures 3A-3D). The mature form of 
LGR8-B is 694 amino acids in length. 

The LGR8-C coding sequence is identical to the LGRS-A coding sequence 
with the exception that the LGR8-C coding sequence lacks a portion of sequence 
encoding the N-terminal extracellular domain. Sequence analysis of the full-length 
1 5 LGR8-C coding sequence indicated that the cDNA comprises a 2046 bp open reading 
frame encoding a protein of 682 amino acids (Figures 5A-5D). The mature form of 
LGR8-C is 646 amino acids in length. 

The LGR8-D coding sequence consists of sequence encoding approximately 
90% of the N-terminal extracellular domain of the LGR8-B coding sequence, but 
2 0 lacks sequence encoding the transmembrane domains and the cytoplasmic C-terminal 
region. Sequence analysis of the full-length LGR8-D coding sequence indicated that 
the cDNA comprises a 1098 bp open reading frame encoding a protein of 366 amino 
acids (Figures 7A-7B). The mature form of LGR8-D is 330 amino acids in length. 

To identify cDNA sequences encoding murine LGR8-A, homology-based 

2 5 BLAST searches of a human genomic database were performed using the amino acid 

sequence of human LGR8-A. A number of sequences sharing a high degree of 
homology were found within a 213 kb mouse public genomic sequence (Accession 
No. AC077689). No exons, genes, or homologies to known genes were identified in 
the AC077689 sequence record. The sequences thus identified were currated by hand 

3 0 and electronically compiled to yield the complete nucleotide sequence encoding 

murine LGR8-A (Figures 8A-8D). A sequence comparison of the mature human and 
murine LGR8-A sequences indicates that the sequences share an 86.6% similarity and 
an 83.1% identity (Figures 11A-11B). A sequence comparison of the N-terminal 
extracellular domains (absent the signal peptide) of human and murine LGR8-A 
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an agarose gel, and PCR products of the expected size (319 bp) were identified in 
adult skeletal muscle, testis, and adrenal gland. 

Intron-spanning PCR was next performed on proprietary oligo-dT primed and 
random primed human cDNA libraries for the following tissues: fetal stomach (oligo- 
5 dT primed), fetal stomach (random primed), pons/medulla (oligo-dT primed), breast 
tumor T14S5 (oligo-dT primed), breast tumor T1485 (random primed), ovary tumor 
T22 (oligo-dT primed), ovary tumor T22 (random primed), fetal thymus (oligo-dT 
primed), fetal thymus (random primed), fetal mesentery (oligo-dT primed), fetal 
mesentery (random primed), placenta (oligo-dT primed), placenta (random primed), 

10 Multiple cell lines [A204, A673, Hs729T, fflSM and RD (oligo-dT primed)], 
Multiple cell lines [A204, A673, Hs729T, HISM and RD (random primed)], fetal 
pancreas (oligo-dT primed), fetal pancreas (random primed), lymphoma cell lines 
(oligo-dT primed), lymphoma cell lines (random primed), ovary tumor T23 (oligo-dT 
primed), ovary tumor T23 (random primed), colon tumor T25 (oligo-dT primed), 

15 colon tumor T25 (random primed), adult T-cells (oligo-dT primed), normalized fetal 
tissue (random primed), fetal heart (oligo-dT primed), fetal heart (random primed), 
fetal bladder (oligo-dT primed), fetal bladder (random primed), fetal kidney (oligo-dT 
primed), fetal kidney (random primed), lung tumor T27 (oligo-dT primed), lung 
tumor T27 (random primed), fetal liver (oligo-dT primed), cytoplasmic breast 

2 0 carcinoma cell lines (oligo-dT primed), cytoplasmic breast carcinoma cell lines 
(random primed), fetal spleen (oligo-dT primed), fetal spleen (random primed), uterus 
(oligo-dT primed), uterus (random primed), adrenal (oligo-dT primed), adrenal 
(random primed), forebrain (oligo-dT primed), forebrain (random primed), testis 
(oligo-dT primed), testis (random primed), colon tumor T24 (oligo-dT primed), colon 

2 5 tumor T24 (random primed), fetal heart (oligo-dT primed), fetal scalp (oligo-dT 

primed), fetal scalp (random primed), fetal lung (oligo-dT primed), fetal lung (random 
primed), trachea (oligo-dT primed), trachea (random primed), cerebellum (oligo-dT 
primed), midbrain LNV block 10 (oligo-dT primed), midbrain LNV block 10 
(random primed), prostate tumor T1940 (random primed), fetal ovary (oligo-dT 

3 0 primed), fetal calveria (oligo-dT primed), fetal calveria (random primed), fetal gall 

bladder (oligo-dT primed), fetal gall bladder (random primed), spinal column (oligo- 
dT primed), spinal column (random primed), thalamus (oligo-dT primed), prostate 
tumor Tl 175 (oligo-dT primed), prostate tumor Tl 175 (random primed), breast tumor 
T1543 (oligo-dT primed), breast tumor T1543 (random primed), fetal skin (oligo-dT 
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performed as described above using 300 nm each of the primers S'-G-T-C-G-A-C-G- 
G-C-G-A-G-C-C-C-3' (SEQ ID NO: 40) and 5'-T-C-T-T-T-G-G-G-A-C-C-T-T-G-T- 
C-T-G-C-A-A-3' (SEQ ID NO: 41) and 200 nm of the flurogenic probe 5'-(6-FAM)- 
T-G-G-G-C-C-G-C-G-T-C-T-C-C-T-T-T-G-A-G-C-T-(TAMRA)-3 ' (Primer Express, 
5 PE BioSystems, Foster City, CA; SEQ ED NO: 42; wherein "6-FAM" is the 5' 
reporter dye 6-carboxy-fluorescein and "TAMRA" is the 3' quencher 6-carboxytetra- 
methylrhodamine). The highest levels of LGR8 mRNA expression were detected in 
skeletal muscle and uterus. Lower levels were found in adrenal and testis, with lower 
levels still in thalamus and bone marrow. 

10 

The expression of LGR8 mRNA is localized by in situ hybridization. A panel 
of normal embryonic and adult mouse tissues is fixed in 4% paraformaldehyde, 
embedded in paraffin, and sectioned at 5 jam. Sectioned tissues are permeabilized in 
0.2 M HC1, digested with Proteinase K, and acetylated with triethanolamine and 

15 acetic anhydride. Sections are prehybridized for 1 hour at 60°C in hybridization 
solution (300 mM NaCl, 20 mM Tris-HCl, pH 8.0, 5 mM EDTA, IX Denhardt's 
solution, 0.2% SDS, 10 mM DTT, 0.25 mg/ml tRNA, 25 jig/ml poly A, 25 jig/ml 
polyC and 50% fonnamide) and then hybridized overnight at 60°C in the same 
solution containing 10% dextran and 2 x 10 4 cpm/jil of a 33 P-labeled antisense 

2 0 riboprobe complementary to the human LGR8 gene. The riboprobe is obtained by in 
vitro transcription of a clone containing human LGR8 cDNA sequences using 
standard techniques. 

Following hybridization, sections are rinsed in hybridization solution, treated 
with RNaseA to digest unhybridized probe, and then washed in 0.1X SSC at 55°C for 

2 5 30 minutes. Sections are then immersed in NTB-2 emulsion (Kodak, Rochester, NY), 

exposed for 3 weeks at 4°C, developed, and counterstained with hematoxylin and 
eosin. Tissue morphology and hybridization signal are simultaneously analyzed by 
darkfield and standard illumination for brain (one sagittal and two coronal sections), 
gastrointestinal tract (esophagus, stomach, duodenum, jejunum, ileum, proximal 

3 0 colon, and distal colon), pituitary, liver, lung, heart, spleen, thymus, lymph nodes, 

kidney, adrenal, bladder, pancreas, salivary gland, male and female reproductive 
organs (ovary, oviduct, and uterus in the female; and testis, epididymus, prostate, 
seminal vesicle, and vas deferens in the male), BAT and WAT (subcutaneous, peri- 
renal), bone (femur), skin, breast, and skeletal muscle. 
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B. Expression of LGR8 Polypeptide in Mammalian Cells 

PCR is used to amplify template DNA sequences encoding an LGR8 
polypeptide using primers corresponding to the 5' and 3' ends of the sequence. The 
amplified DNA products may be modified to contain restriction enzyme sites to allow 
5 for insertion into expression vectors. PCR products are gel purified and inserted into 
expression vectors using standard recombinant DNA methodology. An exemplary 
expression vector, pCEP4 (Invitrogen, Carlsbad, CA), that contains an Epstein-Barr 
virus origin of replication, may be used for the expression of LGR8 polypeptides in 
293-EBNA-l cells. Amplified and gel purified PCR products are ligated into pCEP4 
10 vector and introduced into 293-EBNA cells by lipofection. The transfected cells are 
selected in 100 M-g/mL hygromycin and the resulting drug-resistant cultures are grown 
to confluence. The cells are then cultured in serum-free media for 72 hours. The 
conditioned media is removed and LGR8 polypeptide expression is analyzed by SDS- 
PAGE. 

15 LGR8 polypeptide expression may be detected by silver staining. 

Alternatively, LGR8 polypeptide is produced as a fusion protein with an epitope tag, 

such as an IgG constant domain or a FLAG epitope, which may be detected by 

Western blot analysis using antibodies to the peptide tag. 

LGR8 polypeptides may be excised from an SDS-polyacrylamide gel, or 
2 0 LGR8 fusion proteins are purified by affinity chromatography to the epitope tag, and 

subjected to N-terminal amino acid sequence analysis as described herein. 

C. Purification of LGR8 Polypeptide from Mammalian Cells 

LGR8 polypeptide expression constructs are introduced into 293 EBNA or 

2 5 CHO cells using either a lipofection or calcium phosphate protocol. 

To conduct functional studies on the LGR8 polypeptides that are produced, 
large quantities of conditioned media are generated from a pool of hygromycin 
selected 293 EBNA clones. The cells are cultured in 500 cm Nunc Triple Flasks to 
80% confluence before switching to serum-free media a week prior to harvesting the 

3 0 media. Conditioned media is harvested and frozen at 

-20°C until the protein is to be purified. 

Conditioned media is purified by affinity chromatography as described below. 
The media is thawed and then passed through a 0.2 |xm filter. A Protein G column is 
equilibrated with PBS at pH 7.0, and then loaded with the filtered media. The column 
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changes that are informative as to the function of LGR8 polypeptide. Similarly, a 
construct containing the full-length LGR8 polypeptide under the control of the beta 
actin promoter is prepared. The delivery of this construct is expected to result in 
ubiquitous expression. 

5 To generate these constructs, PCR is used to amplify template DNA sequences 

encoding an LGR8 polypeptide using primers that correspond to the 5' and 3' ends of 
the desired sequence and which incorporate restriction enzyme sites to permit 
insertion of the amplified product into an expression vector. Following amplification, 
PCR products are gel purified, digested with the appropriate restriction enzymes, and 

10 ligated into an expression vector using standard recombinant DNA techniques. For 
example, amplified LGR8 polypeptide sequences can be cloned into an expression 
vector under the control of the human (3-actin promoter as described by Graham et aL, 
1997, Nature Genetics, 17:272-74 and Ray et al, 1991, Genes Dev. 5:2265-73. 

Following ligation, reaction mixtures are used to transform an E. coli host 

15 strain by electroporation and transformants are selected for drug resistance. Plasmid 
DNA from selected colonies is isolated and subjected to DNA sequencing to confirm 
the presence of an appropriate insert and absence of mutation. The LGR8 polypeptide 
expression vector is purified through two rounds of CsCl density gradient 
centrifugation, cleaved with a suitable restriction enzyme, and the linearized fragment 

2 0 containing the LGR8 polypeptide transgene is purified by gel electrophoresis. The 
purified fragment is resuspended in 5 raM Tris, pH 7.4, and 0.2 mM EDTA at a 
concentration of 2 mg/mL. 

Single-cell embryos from BDF1 x BDF1 bred mice are injected as described 
(PCT Pub. No. WO 97/23614). Embryos are cultured overnight in a C0 2 incubator 

2 5 and 15-20 two-cell embryos are transferred to the oviducts of a pseudopregnant CD1 

female mice. Offspring obtained from the implantation of microinjected embryos are 
screened by PCR amplification of the integrated transgene in genomic DNA samples 
as follows. Ear pieces are digested in 20 mL ear buffer (20 mM Tris, pH 8.0, 10 mM 
EDTA, 0.5% SDS, and 500 mg/mL proteinase K) at 55°C overnight. The sample is 

3 0 then diluted with 200 mL of TE, and 2 mL of the ear sample is used in a PCR reaction 

using appropriate primers. 

At 8 weeks of age, transgenic founder animals and control animals are 
sacrificed for necropsy and pathological analysis. Portions of spleen are removed and 
total cellular RNA isolated from the spleens using the Total RNA Extraction Kit 
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chromagen (BioTek, Santa Barbara, CA). Sections are counterstained with 
hematoxylin. 

After necropsy, MLN and sections of spleen and thymus from transgenic 
animals and control littermates are removed. Single cell suspensions are prepared by 
5 gently grinding the tissues with the flat end of a syringe against the bottom of a 100 
mm nylon cell strainer (Becton Dickinson, Franklin Lakes, NJ). Cells are washed 
twice, counted, and approximately 1 x 10 6 cells from each tissue are then incubated 
for 10 minutes with 0.5 fig CD16/32(FcyIII/II) Fc block in a 20 [it volume. Samples 
are then stained for 30 minutes at 2-8°C in a 100 p.L volume of PBS (lacking Ca + and 
10 Mg^), 0.1% bovine serum albumin, and 0.01% sodium azide with 0.5 jag antibody of 
FITC or PE-conjugated monoclonal antibodies against CD90.2 (Thy-1.2), CD45R 
(B220), CDllb(Mac-l), Gr-1, CD4, or CD8 (PharMingen, San Diego, CA). 
Following antibody binding, the cells are washed and then analyzed by flow 
cytometry on a FACScan (Becton Dickinson). 

15 

While the present invention has been described in terms of the preferred 
embodiments, it is understood that variations and modifications will occur to those 
skilled in the art. Therefore, it is intended that the appended claims cover all such 
equivalent variations that come within the scope of the invention as claimed. 
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has an activity of the encoded polypeptide as set forth in any of SEQ ID NO: 2, SEQ 
ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED 
NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ID 
NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23, or is antigenic; 
5 (d) a region of the nucleotide sequence of any of SEQ ID NO: 1, SEQ ID 

NO: 4, SEQ ED NO: 6, SEQ ED NO: 9, SEQ ED NO: 11, SEQ ID NO: 14, SEQ ED 
NO: 16, SEQ ED NO: 19, or SEQ ED NO: 22, or any of (a) - (c) comprising a 
fragment of at least about 16 nucleotides; 

(e) a nucleotide sequence which hybridizes under moderately or highly 
1 0 stringent conditions to the complement of any of (a) - (d); and 

(f) a nucleotide sequence complementary to any of (a) - (d). 

3. An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 

15 (a) a nucleotide sequence encoding a polypeptide as set forth in any of 

SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ 
ED NO: 10, SEQ ID NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ID NO: 23 with at least one 
conservative amino acid substitution, wherein the encoded polypeptide has an activity 

2 0 of the polypeptide set forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, 
SEQ ED NO: 7, SEQ ID NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, 
SEQ ED NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, 
or SEQ ED NO: 23; 

(b) a nucleotide sequence encoding a polypeptide as set forth in any of 

2 5 SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ 

ED NO: 10, SEQ ED NO: 12, SEQ ID NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ 
ED NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23 with at least one 
amino acid insertion, wherein the encoded polypeptide has an activity of the 
polypeptide set forth in any of SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ 

3 0 ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ 

ED NO: 15, SEQ ED NO: 17, SEQ ED NO: 18, SEQ ID NO: 20, SEQ ED NO: 21, or 
SEQ ED NO: 23; 

(c) a nucleotide sequence encoding a polypeptide as set forth in any of 
SEQ ED NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ 



-99- 



WO 02/14489 



PCT/USO 1/25054 



5. A host cell comprising the vector of Claim 4. 

6. The host cell of Claim 5 that is a eukaryotic cell. 

5 

7. The host cell of Claim 5 that is a prokaryotic cell. 

8. A process of producing an LGR8 polypeptide comprising culturing the 
host cell of Claim 5 under suitable conditions to express the polypeptide, and 

1 0 optionally isolating the polypeptide from the culture. 

9. A polypeptide produced by the process of Claim 8. 

10. The process of Claim 8, wherein the nucleic acid molecule comprises 
15 promoter DNA other than the promoter DNA for the native LGR8 polypeptide 

operatively linked to the DNA encoding the LGR8 polypeptide. 

1 1 . The isolated nucleic acid molecule according to Claim 2, wherein the 
percent identity is determined using a computer program selected from the group 

2 0 consisting of GAP, BLASTN, FASTA, BLASTA, BLASTX, BestFit, and the Smith- 
Waterman algorithm. 

12. A process for determining whether a compound inhibits LGR8 
polypeptide activity or LGR8 polypeptide production comprising exposing a cell 

2 5 according to any of Claims 5, 6, or 7 to the compound and measuring LGR8 

polypeptide activity or LGR8 polypeptide production in said cell. 

13. An isolated polypeptide comprising the amino acid sequence as set 
forth in any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ 

3 0 ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ 

ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or SEQ ID NO: 23. 

14. An isolated polypeptide comprising the amino acid sequence selected 
from the group consisting of: 



- 101 - 



WO ((2/14489 



PCT/US01/25054 



12, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 
20, SEQ ID NO: 21, or SEQ ED NO: 23 with at least one conservative amino acid 
substitution, wherein the polypeptide has an activity of the polypeptide set forth in 
any of SEQ ID NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 
5 8, SEQ ED NO: 10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 

17, SEQ ED NO: 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ID NO: 23; 

(b) the amino acid sequence as set forth in any of SEQ ED NO: 2, SEQ ID 
NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 
12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ED NO: 
10 20, SEQ ED NO: 21, or SEQ ED NO: 23 with at least one amino acid insertion, 
wherein the polypeptide has an activity of the polypeptide set forth in any of SEQ ID 
NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 
10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ED NO: 

18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23; 

15 (c) the amino acid sequence as set forth in any of SEQ ED NO: 2, SEQ ID 

NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ED NO: 
12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 
20, SEQ ED NO: 21, or SEQ ED NO: 23 with at least one amino acid deletion, wherein 
the polypeptide has an activity of the polypeptide set forth in any of SEQ ED NO: 2, 

2 0 SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ 
ID NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ID NO: 18, SEQ 
ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23; 

(d) the amino acid sequence as set forth in any of SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ID NO: 8, SEQ ED NO: 10, SEQ ED NO: 

2 5 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ED NO: 

20, SEQ ED NO: 21, or SEQ ED NO: 23 which has a C- and/or N- terminal truncation, 
wherein the polypeptide has an activity of the polypeptide set forth in any of SEQ ID 
NO: 2, SEQ ED NO: 3, SEQ ED NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 
10, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ED NO: 

3 0 18, SEQ ED NO: 20, SEQ ED NO: 21, or SEQ ED NO: 23; and 

(e) the arnino acid sequence as set forth in any of SEQ ED NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 10, SEQ ID NO: 
12, SEQ ED NO: 13, SEQ ED NO: 15, SEQ ED NO: 17, SEQ ED NO: 18, SEQ ED NO: 
20, SEQ ED NO: 21, or SEQ ED NO: 23 with at least one modification selected from 
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22. The selective binding agent of Claim 18 that is a human antibody or 
fragment thereof. 

23. The selective binding agent of Claim 18 that is a polyclonal antibody 
5 or fragment thereof. 

24. The selective binding agent Claim 18 that is a monoclonal antibody or 
fragment thereof. 

10 25. The selective binding agent of Claim 18 that is a chimeric antibody or 

fragment thereof. 

26. The selective binding agent of Claim 18 that is a CDR-grafted 
antibody or fragment thereof. 

15 

27. The selective binding agent of Claim 18 that is an antiidiotypic 
antibody or fragment thereof. 

28. The selective binding agent of Claim 18 that is a variable region 
2 0 fragment. 

29. The variable region fragment of Claim 28 that is a Fab or a Fab' 
fragment. 

2 5 30. A selective binding agent or fragment thereof comprising at least one 

complementarity determining region with specificity for a polypeptide having the 
amino acid sequence of any of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
ED NO: 7, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 21, or 

3 0 SEQ ID NO: 23. 

31. The selective binding agent of Claim 18 that is bound to a detectable 

label. 
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41. The polypeptide of Claim 40 that is covalently modified with a water- 
soluble polymer. 

42. The polypeptide of Claim 41, wherein the water-soluble polymer is 
5 selected from the group consisting of polyethylene glycol, monomethoxy- 

polyethylene glycol, dextran, cellulose, poly-(N-vinyl pyrrolidone) polyethylene 
glycol, propylene glycol homopolymers, polypropylene oxide/ethylene oxide co- 
polymers, polyoxyethylated polyols, and polyvinyl alcohol. 

10 43. A composition comprising a nucleic acid molecule of any of Claims 1 , 

2, or 3 and a pharmaceutical^ acceptable formulation agent. 

44. The composition of Claim 43, wherein said nucleic acid molecule is 
contained in a viral vector. 

15 

45. A viral vector comprising a nucleic acid molecule of any of Claims 1, 

2, or 3. 

46. A fusion polypeptide comprising the polypeptide of any of Claims 13, 
2 0 14, or 15 fused to a heterologous amino acid sequence. 

47. The fusion polypeptide of Claim 46, wherein the heterologous amino 
acid sequence is an IgG constant domain or fragment thereof. 

2 5 48. A method for treating, preventing, or ameliorating a medical condition 

comprising administering to a patient the polypeptide of any of Claims 13, 14, or 15, 
or the polypeptide encoded by the nucleic acid of any of Claims 1, 2, or 3. 

49. A method of diagnosing a pathological condition or a susceptibility to 

3 0 a pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide 
of any of Claims 13, 14, or 15, or the polypeptide encoded by the nucleic acid 
molecule of any of Claims 1, 2, or 3 in a sample; and 
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56. A nucleic acid molecule of any of Claims 1, 2, or 3 attached to a solid 
support. 

57. An array of nucleic acid molecules comprising at least one nucleic acid 
5 molecule of any of Claims 1, 2, or 3. 

58. An isolated polypeptide comprising the amino acid sequence as set 
forth in SEQ ED NO: 2 with at least one conservative amino acid substitution selected 
from the group consisting of: isoleucine at position 26; valine at position 41; 

10 isoleucine at position 55; aspartic acid at position 78; aspartic acid at position 123; 
arginine at position 130; valine at position 135; methionine at position 142; leucine at 
position 166; tyrosine at position 167; lysine at position 201; valine at position 204; 
isoleucine at position 216; glutamatic acid at position 217; leucine at position 221; 
leucine at position 240; leucine at position 252; isoleucine at position 277; methionine 

15 at position 288; lysine at position 290; isoleucine at position 324; isoleucine at 
position 341; isoleucine at position 344; aspartic acid at position 350; leucine at 
position 376; valine at position 420; valine at position 425; valine at position 427; 
isoleucine at position 434; tyrosine at position 442; arginine at position 444; tyrosine 
at position 450; isoleucine at position 466; isoleucine at position 471; leucine at 

2 0 position 476; phenylalanine at position 478; glutamatic acid at position 481; histidine 
at position 485; phenylalanine at position 515; tyrosine at position 521; isoleucine at 
position 522; tyrosine at position 526; valine at position 531; valine at position 541; 
isoleucine at position 551; valine at position 552; glutamatic acid at position 561; 
phenylalanine at position 562; tyrosine at position 566; tyrosine at position 577; 

2 5 aspartic acid at position 579; isoleucine at position 597; isoleucine at position 603; 

valine at position 616; isoleucine at position 621; isoleucine at position 626; lysine at 
position 632; leucine at position 649; isoleucine at position 654; valine at position 
675; isoleucine at position 682; glutamatic acid at position 700; isoleucine at position 
702; tyrosine at position 707; tyrosine at position 709; isoleucine at position 727; 

3 0 valine at position 729; methionine at position 737; methionine at position 745; and 

leucine at position 749; wherein the polypeptide has an activity of the polypeptide as 
set forth in SEQ ID NO: 2. 
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FIG. 1A 



atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 43 

Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu lie 

15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 

Thr Met Phe Phe Leu Leu His Phe He Val Leu He Asn Val Lys Asp 

20 25 30 

ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 144 

Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 

35 40 45 

tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 192 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 240 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 

Gly Asp Thr Ser Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn 

85 90 95 

get aac age gtg gee tta aca cag gag tgc ttt eta aaa cag tat cca 336 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 

100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 

115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 

Leu Lys Ser Val Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu 

130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 

145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His He 

165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 57 6 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu 

180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly He Phe Lys Asp Leu 

195 200 205 
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FIG. IB 



cat cag eta act tgg eta att eta gat gac aat cca ata ace aga att 672 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro He Thr Arg lie 
210 215 220 

tea cag cgc ttg ttt acg gga tta aat tec ttg ttt ttc ctg tct atg 720 

Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 

225 230 235 240 

gtt aat aac tac tta gaa get ctt ccc aag cag atg tgt gec caa atg 768 

Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 
245 250 255 

cct caa etc aac tgg gtg gat ttg gaa ggc aat aga ata aag tat etc 816 

Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg He Lys Tyr Leu 
260 265 270 

aca aat tct acg ttt ctg teg tgc gat teg etc aca gtg ctg ttt ctg 8 64 

Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu 
275 280 285 

cct aga aat caa att ggt ttt gtt cca gag aag aca ttt tct tea tta 912 

Pro Arg Asn Gin lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu 
290 295 300 

aaa aat tta gga gaa ctg gat ctg tct age aat acg ata acg gag eta 960 

Lys Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Thr He Thr Glu Leu 

305 310 315 320 

tea cct cac ctt ttt aaa gac ttg aag ctt eta caa aag ctg aac ctg 1008 

Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu 
325 330 335 

tea tec aat cct ctt atg tat ctt cac aag aac cag ttt gaa agt ctt 1056 

Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu 
340 345 350 

aaa caa ctt cag tct eta gac ctg gaa agg ata gag att cca aat ata 1104 

Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He 
355 360 365 

aac aca cga atg ttt caa ccc atg aag aat ctt tct cac att tat ttc 1152 

Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Phe 
370 375 380 

aaa aac ttt cga tac tgc tec tat get ccc cat gtc cga ata tgt atg 1200 

Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val Arg He Cys Met 

385 390 395 400 

ccc ttg acg gac ggc att tct tea ttt gag gac etc ttg get aac aat 1248 

Pro Leu Thr Asp Gly He Ser Ser Phe Glu Asp Leu Leu Ala Asn Asn 
405 410 415 
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FIG. 1C 



ate etc aga ata ttt gtc tgg gtt ata get ttc att acc tgc ttt gga 1296 

lie Leu Arg lie Phe Val Trp Val lie Ala Phe lie Thr Cys Phe Gly 

420 425 430 

aat ctt ttt gtc att ggc atg aga tct ttc att aaa get gaa aat aca 1344 

Asn Leu Phe Val lie Gly Met Arg Ser Phe lie Lys Ala Glu Asn Thr 
435 440 445 

act cac get atg tec ate aaa ate ctt tgt tgt get gat tgc ctg atg 1392 

Thr His Ala Met Ser lie Lys lie Leu Cys Cys Ala Asp Cys Leu Met 
450 455 460 

ggt gtt tac ttg ttc ttt gtt ggc att ttc gat ata aaa tac cga ggg 1440 

Gly Val Tyr Leu Phe Phe Val Gly lie Phe Asp lie Lys Tyr Arg Gly 

465 470 475 480 

cag tat cag aag tat gec ttg ctg tgg atg gag age gtg cag tgc cgc 

Gin Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Gin Cys Arg 
485 490 495 

etc atg ggg ttc ctg gee atg ctg tec acc gaa gtc tct gtt ctg eta 

Leu Met Gly Phe Leu Ala Met Leu Ser Thr Glu Val Ser Val Leu Leu 

500 505 510 

ctg acc tac ttg act ttg gag aag ttc ctg gtc att gtc ttc ccc ttc 

Leu Thr Tyr Leu Thr Leu Glu Lys Phe Leu Val lie Val Phe Pro Phe 
515 520 525 

agt aac att cga cct gga aaa egg cag acc tea gtc ate etc att tgc 1632 

Ser Asn lie Arg Pro Gly Lys Arg Gin Thr Ser Val lie Leu lie Cys 
530 535 540 



ate tgg atg gcg gga ttt tta ata get gta att cca ttt tgg aat aag 
lie Trp Met Ala Gly Phe Leu lie Ala Val lie Pro Phe Trp Asn Lys 
545 550 555 560 



att ttc eta ggt gtg aac ttg ctg get ttt etc ate att gtg ttt tec 
lie Phe Leu Gly Val Asn Leu Leu Ala Phe Leu lie lie Val Phe Ser 
595 600 605 



1488 



1536 



1584 



1680 



gat tat ttt gga aac ttt tat ggg aaa aat gga gta tgt ttc cca ctt 1728 

Asp Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu 

565 570 575 

tat tat gac caa aca gaa gat att gga age aaa ggg tat tct ctt gga 1776 

Tyr Tyr Asp Gin Thr Glu Asp lie Gly Ser Lys Gly Tyr Ser Leu Gly 

580 585 590 



1824 



tat att act atg ttc tgt tec att caa aaa acc gec ttg cag acc aca 1872 
Tyr He Thr Met Phe Cys Ser He Gin Lys Thr Ala Leu Gin Thr Thr 
610 615 620 
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FIG. ID 



gaa gta agg aat tgt ttt gga aga gag gtg get gtt gca aat cgt ttc 1920 

Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val Ala Asn Arg Phe 

625 630 635 640 

ttt ttt ata gtg ttc tct gat gec ate tgc tgg att cct gta ttt gta 1968 

Phe Phe lie Val Phe Ser Asp Ala lie Cys Trp lie Pro Val Phe Val 
645 650 655 

gtt aaa ate ctt tec etc ttc egg gtg gaa ata cca gac aca atg act 2016 

Val Lys lie Leu Ser Leu Phe Arg Val Glu lie Pro Asp Thr Met Thr 

660 665 670 

tec tgg ata gtg att ttt ttc ctt cca gtt aac agt get ttg aat cca 2064 

Ser Trp lie Val lie Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro 
675 680 685 

ate etc tat act etc aca acc aac ttt ttt aag gac aag ttg aaa cag 2112 

lie Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp Lys Leu Lys Gin 

690 695 700 

ctg ctg cac aaa cat cag agg aaa tea att ttc aaa att aaa aaa aaa 2160 

Leu Leu His Lys His Gin Arg Lys Ser lie Phe Lys lie Lys Lys Lys 

705 710 715 720 

agt tta tct aca tec att gtg tgg ata gag gac tec tct tec ctg aaa 2208 

Ser Leu Ser Thr Ser lie Val Trp lie Glu Asp Ser Ser Ser Leu Lys 
725 730 735 

ctt ggg gtt ttg aac aaa ata aca ctt gga gac agt ata atg aaa cca 2256 

Leu Gly Val Leu Asn Lys lie Thr Leu Gly Asp Ser lie Met Lys Pro 
740 745 750 



gtt tec tag 2265 
Val Ser 

755 
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FIG. 2A 



caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 4 8 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 

15 10 15 

ggg aat ctt ace aag tgc tta ccc cga get ttt cac tgt gat ggc aag 96 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 

20 25 30 

gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 14 4 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 

35 40 45 

gga tgg gcg acc ata ttt ggc aca gtg cat gga aat get aac age gtg 192 

Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 

50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 288 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 

85 90 95 

ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 336 

Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 

100 105 110 

ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 384 

lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 

115 120 125 

aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 4 32 

Lys lie Phe Leu Gin His Asn Cys lie Arg His lie Ser Arg Lys Ala 

130 135 140 

ttt ttt gga tta tgt aat ctg caa ata tta tat etc aac cac aac tgc 480 

Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

ate aca acc etc aga cct gga ata ttc aaa gac tta cat cag eta act 528 

lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 

165 170 175 

tgg eta att eta gat gac aat cca ata acc aga att tea cag cgc ttg 576 

Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Arg Leu 

180 185 190 

ttt acg gga tta aat tec ttg ttt ttc ctg tct atg gtt aat aac tac 624 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 

195 200 205 
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FIG. 2B 



tta gaa get ctt ccc aag cag atg tgt gec caa atg cct caa etc aac 672 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 

210 215 220 

tgg gtg gat ttg gaa ggc aat aga ata aag tat etc aca aat tct acg 720 

Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 

225 230 235 240 

ttt ctg teg tgc gat teg etc aca gtg ctg ttt ctg cct aga aat caa 768 

Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 

245 250 255 

att ggt ttt gtt cca gag aag aca ttt tct tea tta aaa aat tta gga 816 

lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 

260 265 270 

gaa ctg gat ctg tct age aat acg ata acg gag eta tea cct cac ctt 864 

Glu Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu 

275 280 285 

ttt aaa gac ttg aag ctt eta caa aag ctg aac ctg tea tec aat cct 912 

Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 

290 295 300 

ctt atg tat ctt cac aag aac cag ttt gaa agt ctt aaa caa ctt cag 960 

Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu Lys Gin Leu Gin 

305 310 315 320 

tct eta gac ctg gaa agg ata gag att cca aat ata aac aca cga atg 1008 

Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He Asn Thr Arg Met 

325 330 335 

ttt caa ccc atg aag aat ctt tct cac att tat ttc aaa aac ttt cga 1056 

Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Phe Lys Asn Phe Arg 

340 345 350 

tac tgc tec tat get ccc cat gtc cga ata tgt atg ccc ttg acg gac 1104 

Tyr Cys Ser Tyr Ala Pro His Val Arg He Cys Met Pro Leu Thr Asp 

355 360 365 

ggc att tct tea ttt gag gac etc ttg get aac aat ate etc aga 114 9 

Gly He Ser Ser Phe Glu Asp Leu Leu Ala Asn Asn He Leu Arg 

370 375 380 
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FIG. 3A 



atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 48 
Met lie Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu lie 
15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 
Thr Met Phe Phe Leu Leu His Phe lie Val Leu lie Asn Val Lys Asp 
20 25 30 



ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 
Phe Ala Leu Thr Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly 
35 40 45 



tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 
Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 



144 



tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 192 
Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 
50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 24 0 
Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 
Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 
85 90 95 

get aac age gtg gee tta aca cag gag tgc ttt eta aaa cag tat cca 336 
Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 
Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 



432 



aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 
Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 
145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 
Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 57 6 
Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 
180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 
Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 
195 200 205 
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FIG. 3B 



cat cag eta act tgg eta att eta gat gac aat cca ata acc aga att 672 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie 

210 215 220 

tea cag cgc ttg ttt acg gga tta aat tec ttg ttt ttc ctg tct atg 720 

Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 

225 230 235 240 

gtt aat aac tac tta gaa get ctt ccc aag cag atg tgt gec caa atg 7 68 

Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 

245 250 255 

cct caa etc aac tgg gtg gat ttg gaa ggc aat aga ata aag tat etc 816 

Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu 

260 265 270 

aca aat tct acg ttt ctg teg tgc gat teg etc aca gtg ctg gat ctg 864 

Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu 

275 280 285 

tct age aat acg ata acg gag eta tea cct cac ctt ttt aaa gac ttg 912 

Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu Phe Lys Asp Leu 

290 295 300 

aag ctt eta caa aag ctg aac ctg tea tec aat cct ctt atg tat ctt 960 

Lys Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro Leu Met Tyr Leu 

305 310 315 320 

cac aag aac cag ttt gaa agt ctt aaa caa ctt cag tct eta gac ctg 1008 

His Lys Asn Gin Phe Glu Ser Leu Lys Gin Leu Gin Ser Leu Asp Leu 

325 330 335 

gaa agg ata gag att cca aat ata aac aca cga atg ttt caa ccc atg 1056 

Glu Arg lie Glu lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met 

340 345 350 

aag aat ctt tct cac att tat ttc aaa aac ttt cga tac tgc tec tat 1104 

Lys Asn Leu Ser His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr 

355 360 365 

get ccc cat gtc cga ata tgt atg ccc ttg acg gac ggc att tct tea 1152 

Ala Pro His Val Arg lie Cys Met Pro Leu Thr Asp Gly lie Ser Ser 

370 375 380 

ttt gag gac etc ttg get aac aat ate etc aga ata ttt gtc tgg gtt 1200 

Phe Glu Asp Leu Leu Ala Asn Asn lie Leu Arg lie Phe Val Trp Val 

385 390 395 400 

ata get ttc att acc tgc ttt gga aat ctt ttt gtc att ggc atg aga 1248 

lie Ala Phe lie Thr Cys Phe Gly Asn Leu Phe Val He Gly Met Arg 

405 410 415 
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FIG. 3C 



tct ttc att aaa get gaa aat aca act cac get atg tec ate aaa ate 
Ser Phe lie Lys Ala Glu Asn Thr Thr His Ala Met Ser He Lys He 
420 425 430 

ctt tgt tgt get gat tgc ctg atg ggt gtt tac ttg ttc ttt gtt ggc 
Leu Cys Cys Ala Asp Cys Leu Met Gly Val Tyr Leu Phe Phe Val Gly 
435 440 445 

att ttc gat ata aaa tac cga ggg cag tat cag aag tat gee ttg ctg 
He Phe Asp lie Lys Tyr Arg Gly Gin Tyr Gin Lys Tyr Ala Leu Leu 
450 455 460 

tgg atg gag age gtg cag tgc cgc etc atg ggg ttc ctg gee atg ctg 
Trp Met Glu Ser Val Gin Cys Arg Leu Met Gly Phe Leu Ala Met Leu 
465 470 475 480 

tec ace gaa gtc tct gtt ctg eta ctg acc tac ttg act ttg gag aag 
Ser Thr Glu Val Ser Val Leu Leu Leu Thr Tyr Leu Thr Leu Glu Lys 
485 490 495 

ttc ctg gtc att gtc ttc ccc ttc agt aac att cga cct gga aaa egg 
Phe Leu Val He Val Phe Pro Phe Ser Asn He Arg Pro Gly Lys Arg 
500 505 510 

cag acc tea gtc ate etc att tgc ate tgg atg gcg gga ttt tta ata 
Gin Thr Ser Val lie Leu Tie Cys He Trp Met Ala Gly Phe Leu He 
515 520 525 

get gta att cca ttt tgg aat aag gat tat ttt gga aac ttt tat ggg 
Ala Val He Pro Phe Trp Asn Lys Asp Tyr Phe Gly Asn Phe Tyr Gly 
530 535 540 

aaa aat gga gta tgt ttc cca ctt tat tat gac caa aca gaa gat att 
Lys Asn Gly Val Cys Phe Pro Leu Tyr Tyr Asp Gin Thr Glu Asp He 
545 550 555 560 

gga age aaa ggg tat tct ctt gga att ttc eta ggt gtg aac ttg ctg 
Gly Ser Lys Gly Tyr Ser Leu Gly He Phe Leu Gly Val Asn Leu Leu 
565 570 575 

get ttt etc ate att gtg ttt tec tat att act atg ttc tgt tec att 
Ala Phe Leu He He Val Phe Ser Tyr He Thr Met Phe Cys Ser He 
580 585 590 

caa aaa acc gee ttg cag acc aca gaa gta agg aat tgt ttt gga aga 
Gin Lys Thr Ala Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg 
595 600 605 

gag gtg get gtt gca aat cgt ttc ttt ttt ata gtg ttc tct gat gee 
Glu Val Ala Val Ala Asn Arg Phe Phe Phe He Val Phe Ser Asp Ala 
610 615 620 
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1344 
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1440 



1488 
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1680 
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FIG. 3D 



ate tgc tgg att cct gta ttt gta gtt aaa ate ctt tec etc ttc egg 
lie Cys Trp lie Pro Val Phe Val Val Lys lie Leu Ser Leu Phe Arg 
625 630 635 640 



1920 



gtg gaa ata cca gac aca atg act tec tgg ata gtg att ttt ttc ctt 
Val Glu He Pro Asp Thr Met Thr Ser Trp He Val He Phe Phe Leu 
645 650 655 



,968 



cca gtt aac agt get ttg aat cca ate etc tat act etc aca ace aac 
Pro Val Asn Ser Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn 
660 665 670 



2016 



ttt ttt aag gac aag ttg aaa cag ctg ctg cac aaa cat cag agg aaa 
Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys 
675 680 685 



2064 



tea att ttc aaa att aaa aaa aaa agt tta tct aca tec att gtg tgg 
Ser He Phe Lys He Lys Lys Lys Ser Leu Ser Thr Ser He Val Trp 
690 695 700 



1112 



ata gag gac tec tct tec ctg aaa ctt ggg gtt ttg aac aaa ata aca 
He Glu Asp Ser Ser Ser Leu Lys Leu Gly Val Leu Asn Lys lie Thr 
705 710 715 720 



2160 



ctt gga gac agt ata atg aaa cca gtt tec tag 
Leu Gly Asp Ser lie Met Lys Pro Val Ser 
725 730 



1193 
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FIG. 4A 



caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 48 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 

15 10 15 

ggg aat ctt ace aag tgc tta ccc cga get ttt cac tgt gat ggc aag 96 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 

20 25 30 

gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 14 4 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 

35 40 45 

gga tgg gcg ace at a ttt ggc aca gtg cat gga aat get aac age gtg 1 92 

Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 

50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 

65 70 75 80 

tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 288 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 

85 90 95 

ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 336 

Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 

100 105 110 

ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 384 

He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 

115 120 125 

aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 432 

Lys He Phe Leu Gin His Asn Cys lie Arg His He Ser Arg Lys Ala 

130 135 140 

ttt ttt gga tta tgt aat ctg caa ata tta tat etc aac cac aac tgc 480 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu Asn His Asn Cys 

145 150 155 160 

ate aca acc etc aga cct gga ata ttc aaa gac tta cat cag eta act 528 

He Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 

165 170 175 

tgg eta att eta gat gac aat cca ata acc aga att tea cag cgc ttg 576 

Trp Leu He Leu Asp Asp Asn Pro lie Thr Arg He Ser Gin Arg Leu 

180 185 190 

ttt acg gga tta aat tec ttg ttt ttc ctg tct atg gtt aat aac tac 624 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 

195 200 205 
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FIG. 4B 



tta gaa get ctt ccc aag cag atg tgt gec caa atg cct caa etc aac 672 
Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

tgg gtg gat ttg gaa ggc aat aga ata aag tat etc aca aat tct acg 720 
Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 
225 230 235 240 

ttt ctg teg tgc gat teg etc aca gtg ctg gat ctg tct age aat acg 7 68 
Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu Ser Ser Asn Thr 
245 250 255 

ata acg gag eta tea cct cac ctt ttt aaa gac ttg aag ctt eta caa 816 
lie Thr Glu Leu Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin 
260 265 270 

aag ctg aac ctg tea tec aat cct ctt atg tat ctt cac aag aac cag 864 
Lys Leu Asn Leu Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin 
275 280 285 

ttt gaa agt ctt aaa caa ctt cag tct eta gac ctg gaa agg ata gag 912 
Phe Glu Ser Leu Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg lie Glu 
290 295 300 

att cca aat ata aac aca cga atg ttt caa ccc atg aag aat ctt tct 960 
lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 
305 310 315 320 

cac att tat ttc aaa aac ttt cga tac tgc tec tat get ccc cat gtc 1008 
His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val 
325 330 335 

cga ata tgt atg ccc ttg acg gac ggc att tct tea ttt gag gac etc 1056 
Arg lie Cys Met Pro Leu Thr Asp Gly lie Ser Ser Phe Glu Asp Leu 
340 345 350 

ttg get aac aat ate etc aga 1077 
Leu Ala Asn Asn lie Leu Arg 
355 
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FIG. 5A 



atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 4 8 

Met lie Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu lie 

15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 

Thr Met Phe Phe Leu Leu His Phe lie Val Leu lie Asn Val Lys Asp 

20 25 30 

ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 14 4 

Phe Ala Leu Thr Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly 

35 40 45 

tat ttt ccc tgt ggg aat ctt ace aag tgc tta ccc cga get ttt cac 192 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 24 0 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 

Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 

85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 

100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 

115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 

Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 

130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 4 80 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 

145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 

165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta att eta 576 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu lie Leu 

180 185 190 

gat gac aat cca ata acc aga att tea cag cgc ttg ttt acg gga tta 624 

Asp Asp Asn Pro lie Thr Arg lie Ser Gin Arg Leu Phe Thr Gly Leu 

195 200 205 
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FIG. 5B 



aat tec ttg ttt ttc ctg tct atg gtt aat aac tac tta gaa get ctt 672 
Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr Leu Glu Ala Leu 
210 215 220 

ccc aag cag atg tgt gec caa atg cct caa etc aac tgg gtg gat ttg 720 
Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn Trp Val Asp Leu 
225 230 235 240 

gaa ggc aat aga ata aag tat etc aca aat tct acg ttt ctg teg tgc 768 
Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr Phe Leu Ser Cys 
245 250 255 

gat teg etc aca gtg ctg gat ctg tct age aat acg ata acg gag eta 816 
Asp Ser Leu Thr Val Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu 
260 265 270 

tea cct cac ctt ttt aaa gac ttg aag ctt eta caa aag eta gac ctg 864 
Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asp Leu 
275 280 285 

gaa agg ata gag att cca aat ata aac aca cga atg ttt caa ccc atg 912 
Glu Arg lie Glu lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met 
290 295 300 

aag aat ctt tct cac att tat ttc aaa aac ttt cga tac tgc tec tat 960 
Lys Asn Leu Ser His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr 
305 310 315 320 

get ccc cat gtc cga ata tgt atg ccc ttg acg gac ggc att tct tea 1008 
Ala Pro His Val Arg lie Cys Met Pro Leu Thr Asp Gly lie Ser Ser 
325 330 335 

ttt gag gac etc ttg get aac aat ate etc aga ata ttt gtc tgg gtt 1056 
Phe Glu Asp Leu Leu Ala Asn Asn lie Leu Arg lie Phe Val Trp Val 
340 345 350 

ata get ttc att acc tgc ttt gga aat ctt ttt gtc att ggc atg aga 1104 
lie Ala Phe lie Thr Cys Phe Gly Asn Leu Phe Val lie Gly Met Arg 
355 360 365 

tct ttc att aaa get gaa aat aca act cac get atg tec ate aaa ate 1152 
Ser Phe lie Lys Ala Glu Asn Thr Thr His Ala Met Ser lie Lys lie 
370 375 380 

ctt tgt tgt get gat tgc ctg atg ggt gtt tac ttg ttc ttt gtt ggc 1200 
Leu Cys Cys Ala Asp Cys Leu Met Gly Val Tyr Leu Phe Phe Val Gly 
385 390 395 400 

att ttc gat ata aaa tac cga ggg cag tat cag aag tat gec ttg ctg 1248 
lie Phe Asp lie Lys Tyr Arg Gly Gin Tyr Gin Lys Tyr Ala Leu Leu 
405 410 415 
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FIG. 5C 



tgg atg gag age gtg cag tgc cgc etc atg ggg ttc ctg gec atg ctg 1296 

Trp Met Glu Ser Val Gin Cys Arg Leu Met Gly Phe Leu Ala Met Leu 
420 425 430 

tec acc gaa gtc tct gtt ctg eta ctg ace tac ttg act ttg gag aag 1344 

Ser Thr Glu Val Ser Val Leu Leu Leu Thr Tyr Leu Thr Leu Glu Lys 
435 440 445 

ttc ctg gtc att gtc ttc ccc ttc agt aac att cga cct gga aaa egg 1392 

Phe Leu Val lie Val Phe Pro Phe Ser Asn lie Arg Pro Gly Lys Arg 

450 455 460 

cag acc tea gtc ate etc att tgc ate tgg atg gcg gga ttt tta ata 1440 

Gin Thr Ser Val lie Leu lie Cys lie Trp Met Ala Gly Phe Leu He 

465 470 475 480 

get gta att cca ttt tgg aat aag gat tat ttt gga aac ttt tat ggg 1488 

Ala Val He Pro Phe Trp Asn Lys Asp Tyr Phe Gly Asn Phe Tyr Gly 
485 490 495 

aaa aat gga gta tgt ttc cca ctt tat tat gac caa aca gaa gat att 1536 

Lys Asn Gly Val Cys Phe Pro Leu Tyr Tyr Asp Gin Thr Glu Asp He 
500 505 510 

gga age aaa ggg tat tct ctt gga att ttc eta ggt gtg aac ttg ctg 1584 

Gly Ser Lys Gly Tyr Ser Leu Gly He Phe Leu Gly Val Asn Leu Leu 

515 520 525 

get ttt etc ate att gtg ttt tec tat att act atg ttc tgt tec att 1632 

Ala Phe Leu He He Val Phe Ser Tyr He Thr Met Phe Cys Ser He 

530 535 540 

caa aaa acc gec ttg cag acc aca gaa gta agg aat tgt ttt gga aga 1680 

Gin Lys Thr Ala Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg 

545 550 555 560 

gag gtg get gtt gca aat cgt ttc ttt ttt ata gtg ttc tct gat gec 1728 

Glu Val Ala Val Ala Asn Arg Phe Phe Phe lie Val Phe Ser Asp Ala 
565 570 575 

ate tgc tgg att cct gta ttt gta gtt aaa ate ctt tec etc ttc egg 1776 

He Cys Trp lie Pro Val Phe Val Val Lys He Leu Ser Leu Phe Arg 
580 585 590 

gtg gaa ata cca gac aca atg act tec tgg ata gtg att ttt ttc ctt 1824 

Val Glu lie Pro Asp Thr Met Thr Ser Trp lie Val He Phe Phe Leu 
595 600 605 



cca gtt aac agt get ttg aat cca ate etc tat act etc aca acc aac 
Pro Val Asn Ser Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn 
610 615 620 



1872 



WO 02/14489 



PCT/USOl/25054 



FIG. 5D 



ttt ttt aag gac aag ttg aaa cag ctg ctg cac aaa cat cag agg aaa 1920 

Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys 

625 630 635 640 

tea att ttc aaa att aaa aaa aaa agt tta tct aca tec att gtg tgg 1968 

Ser lie Phe Lys lie Lys Lys Lys Ser Leu Ser Thr Ser lie Val Trp 

645 650 655 

ata gag gac tec tct tec ctg aaa ctt ggg gtt ttg aac aaa ata aca 2016 

lie Glu Asp Ser Ser Ser Leu Lys Leu Gly Val Leu Asn Lys lie Thr 

660 665 670 



ctt gga gac agt ata atg aaa cca gtt tec tag 
Leu Gly Asp Ser lie Met Lys Pro Val Ser 
675 680 



2049 
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FIG. 6A 



caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 48 
Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

ggg aat ctt ace aag tgc tta ccc cga get ttt cac tgt gat ggc aag 96 
Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 14 4 
Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

gga tgg gcg acc at a ttt ggc aca gtg cat gga aat get aac age gtg 192 
Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 
Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 288 
Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 



ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 
Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 HO 



aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 
Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 



336 



ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 384 
He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 
115 120 125 



431 



ttt ttt gga tta tgt aat ctg caa ata tta att eta gat gac aat cca 480 

Phe Phe Gly Leu Cys Asn Leu Gin lie Leu lie Leu Asp Asp Asn Pro 

145 150 155 160 

ata acc aga att tea cag cgc ttg ttt acg gga tta aat tec ttg ttt 528 

He Thr Arg He Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe 
165 170 175 

ttc ctg tct atg gtt aat aac tac tta gaa get ctt ccc aag cag atg 576 

Phe Leu Ser Met Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met 
180 185 190 

tgt gee caa atg cct caa etc aac tgg gtg gat ttg gaa ggc aat aga 624 

Cys Ala Gin Met Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg 
195 200 205 
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FIG. 6B 



ata aag tat etc aca aat tct acg ttt ctg teg tgc gat teg etc aca 672 

lie Lys Tyr Leu Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr 

210 215 220 

gtg ctg gat ctg tct age aat acg ata acg gag eta tea cct cac ctt 720 

Val Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu 

225 230 235 240 

ttt aaa gac ttg aag ctt eta caa aag eta gac ctg gaa agg ata gag 768 

Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asp Leu Glu Arg lie Glu 

245 250 255 

att cca aat ata aac aca cga atg ttt caa ccc atg aag aat ctt tct 816 

lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 

260 265 270 

cac att tat ttc aaa aac ttt cga tac tgc tec tat get ccc cat gtc 864 

His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val 

275 280 285 

cga ata tgt atg ccc ttg acg gac ggc att tct tea ttt gag gac etc 912 

Arg lie Cys Met Pro Leu Thr Asp Gly lie Ser Ser Phe Glu Asp Leu 

290 295 300 



ttg get aac aat ate etc aga 
Leu Ala Asn Asn lie Leu Arg 
305 310 



933 
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FIG. 7 A 



atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 48 

Met lie Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu lie 

15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 

Thr Met Phe Phe Leu Leu His Phe lie Val Leu lie Asn Val Lys Asp 

20 25 30 

ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 144 

Phe Ala Leu Thr Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly 

35 40 45 

tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 192 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 24 0 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 

Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 
85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 

100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 

115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 

Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 

130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 4 80 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 

145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 576 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 

180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 

195 200 205 
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FIG. 7B 



cat cag eta 
His Gin Leu 
210 

tea cag cgc 
Ser Gin Arg 
225 

gtt aat aac 
Val Asn Asn 



cct caa etc 
Pro Gin Leu 



aca aat tct 
Thr Asn Ser 
275 

tct age aat 
Ser Ser Asn 
290 

aag ctt eta 
Lys Leu Leu 

305 

cac aag aac 
His Lys Asn 



gaa agg ata 
Glu Arg lie 



aag aat ctt 
Lys Asn Leu 
355 



act tgg eta 
Thr Trp Leu 



ttg ttt acg 
Leu Phe Thr 
230 

tac tta gaa 
Tyr Leu Glu 
245 

aac tgg gtg 
Asn Trp Val 
260 

acg ttt ctg 
Thr Phe Leu 



acg ata acg 
Thr lie Thr 



caa aag ctg 
Gin Lys Leu 
310 

cag ttt gaa 
Gin Phe Glu 
325 

gag att cca 
Glu lie Pro 
340 

tct cac ata 
Ser His lie 



att eta gat 
Tie Leu Asp 
215 

gga tta aat 
Gly Leu Asn 



get ctt ccc 
Ala Leu Pro 



gat ttg gaa 
Asp Leu Glu 
265 

teg tgc gat 
Ser Cys Asp 
280 

gag eta tea 
Glu Leu Ser 
295 

aac ctg tea 
Asn Leu Ser 



agt ctt aaa 
Ser Leu Lys 



aat ata aac 
Asn lie Asn 
345 

gtt caa tat 
Val Gin Tyr 
360 



gac aat cca 
Asp Asn Pro 
220 

tec ttg ttt 
Ser Leu Phe 
235 

aag cag atg 
Lys Gin Met 

250 

ggc aat aga 
Gly Asn Arg 



teg etc aca 
Ser Leu Thr 



cct cac ctt 
Pro His Leu 
300 

tec aat cct 
Ser Asn Pro 
315 

caa ctt cag 
Gin Leu Gin 
330 

aca cga atg 
Thr Arg Met 



tat gat gtg 
Tyr Asp Val 



ata ace aga 
lie Thr Arg 



ttc ctg tct 
Phe Leu Ser 



tgt gec caa 
Cys Ala Gin 
255 

ata aag tat 
lie Lys Tyr 
270 

gtg ctg gat 
Val Leu Asp 

285 

ttt aaa gac 
Phe Lys Asp 



ctt atg tat 
Leu Met Tyr 



tct eta gac 
Ser Leu Asp 
335 

ttt caa ccc 
Phe Gin Pro 
350 

ccg aca tga 
Pro Thr 
365 



att 672 
He 



atg 720 

Met 

240 

atg 768 
Met 



etc 816 
Leu 



ctg 864 
Leu 



ttg 912 
Leu 



ctt 960 

Leu 

320 

ctg 1008 
Leu 



atg 1056 
Met 



1101 
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FIG. 8A 



atg tgg etc eta ctt cat gtc ate ctt ctg aca gag gtc aaa gat ttt 48 

Met Trp Leu Leu Leu His Val lie Leu Leu Thr Glu Val Lys Asp Phe 

15 10 15 

gca ctg get gac age agt atg gtg get cct ctg tgc ccc aaa ggg tat 96 

Ala Leu Ala Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr 

20 25 30 

ttt ccc tgt ggg aat etc acc aaa tgc ttg ccc cga gec ttt cac tgc 144 

Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys 

35 40 45 

gat ggt gtg gat gat tgc ggg aat ggt gec gac gag gac aac tgt ggt 192 

Asp Gly Val Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly 

50 55 60 

gac act agt gga tgg aca acc ata ttt ggc aca gtc cat ggg aat gtc 240 

Asp Thr Ser Gly Trp Thr Thr lie Phe Gly Thr Val His Gly Asn Val 

65 70 75 80 

aat aaa gtg aca ttg aca cag gag tgc ttt etc age cag tat cca cag 288 

Asn Lys Val Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin 

85 90 95 

cac tgt tac tgc aga gaa aat gaa ctg gaa tgt gta aag get gac tta 336 

His Cys Tyr Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu 

100 105 110 

aaa get gtg cca aag gtt tec age aac gta aca tta eta tct ctt aag 384 

Lys Ala Val Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys 

115 120 125 

aaa aac aaa ate cac aga ctt cca gtc aag gtc ttc age aga tac aca 432 

Lys Asn Lys lie His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr 

130 135 140 

gaa etc aga aag ata tac ctt cag cac aac tgc ate aca cac ate tec 480 

Glu Leu Arg Lys lie Tyr Leu Gin His Asn Cys lie Thr His lie Ser 

145 150 155 160 

agg aga gca ttc ctt gga tta cat aat eta caa ata ctg tat etc age 528 

Arg Arg Ala Phe Leu Gly Leu His Asn Leu Gin lie Leu Tyr Leu Ser 

165 170 175 

cat aac tgc att acc tct etc agg cct ggg ata ttc aaa gac ttg cat 576 

His Asn Cys lie Thr Ser Leu Arg Pro Gly lie Phe Lys Asp Leu His 

180 185 190 

cag ctt get tgg eta att tta gat gac aac ccg ate acc aga ate tea 624 

Gin Leu Ala Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He Ser 

195 200 205 
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FIG. 8B 



cag aag tec ttt atg ggg tta aac tec ttg ttt ttc ttg tec atg gtg 672 

Gin Lys Ser Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val 

210 215 220 

ggt aac egg etc gag gec ctt cct gaa aca ttg tgt get cag atg cct 720 

Gly Asn Arg Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro 

225 230 235 240 

caa etc aac tgg gtg gat ctg gca aac aat gga ata aag tac ata acg 768 

Gin Leu Asn Trp Val Asp Leu Ala Asn Asn Gly lie Lys Tyr lie Thr 

245 250 255 

aac tec acc ttc eta acg tgc gac teg etc acg gtt ctg ttt ctg cct 816 

Asn Ser Thr Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro 

260 265 270 

aga aat caa att ggt ttt gtt cca gag aag aca ttt tct tea tta aaa 864 

Arg Asn Gin lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys 

275 230 285 

aat tta gga gaa ctg gac ctg tct age aat atg ata aca aaa etc cca 912 

Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Met He Thr Lys Leu Pro 

290 295 300 

gtc cac ctt ttc age gac ctt cat ctt etc cag aag ctg aac ctg tea 960 

Val His Leu Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser 

305 310 315 320 

tec aac cct ctt ctg tat gtc cac aag aac cag ttt gga agt etc aaa 1008 

Ser Asn Pro Leu Leu Tyr Val His Lys Asn Gin Phe Gly Ser Leu Lys 

325 330 335 

caa ctt cag tct eta gac ctg gaa agg ata gag att cca aac ata age 1056 

Gin Leu Gin Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He Ser 

340 345 350 

aca gga atg ttc cag cca atg aag aac ctt tct cac att tat ttg aaa 1104 

Thr Gly Met Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Leu Lys 

355 360 365 

acc ttt cga tac tgc tec tat gtc ccc cat gtc cga ate tgt atg ccg 1152 

Thr Phe Arg Tyr Cys Ser Tyr Val Pro His Val Arg He Cys Met Pro 

370 375 380 

teg act gat ggt att tct teg tct gag gac etc ttg get aac ggt ate 1200 

Ser Thr Asp Gly He Ser Ser Ser Glu Asp Leu Leu Ala Asn Gly He 

385 390 395 400 

etc aga gtg tct gtc tgg gtt ata get ttc att acc tgc gtt ggg aat 1248 

Leu Arg Val Ser Val Trp Val He Ala Phe lie Thr Cys Val Gly Asn 

405 410 415 
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FIG. 8C 



ttc ctt gtc at a gcc gtg aga tct etc att aag get gag aat aca act 12 96 

Phe Leu Val lie Ala Val Arg Ser Leu lie Lys Ala Glu Asn Thr Thr 
420 425 430 

cac get atg tec ate aaa ate ctt tgt tgt gcc gat tgc ctg atg ggg 134 4 

His Ala Met Ser lie Lys lie Leu Cys Cys Ala Asp Cys Leu Met Gly 
435 440 445 

gtg tac ctg ttc tec gtg ggc gtc ttt gac ate aag tac cga ggg cag 1392 

Val Tyr Leu Phe Ser Val Gly Val Phe Asp lie Lys Tyr Arg Gly Gin 
450 455 460 

tat cag aag tat gcg ctg ctg tgg atg gag agt gtg ccc tgc cgc ctg 14 4 0 

Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Pro Cys Arg Leu 

465 470 475 480 

ctg ggc ttc ctg gcc acg ctg tec aca gag gtc teg gtg ctg ctg ctg 1488 

Leu Gly Phe Leu Ala Thr Leu Ser Thr Glu Val Ser Val Leu Leu Leu 
485 490 495 

aca ttc ctg acg ctg gag aag ttc ctt gtc ata gta ttc cct ttc age 1536 

Thr Phe Leu Thr Leu Glu Lys Phe Leu Val He Val Phe Pro Phe Ser 
500 505 510 

aac ctg cgc ctg ggc aag cgc cag act get gtg gcc etc gcc age ate 158 4 

Asn Leu Arg Leu Gly Lys Arg Gin Thr Ala Val Ala Leu Ala Ser He 
515 520 525 

tgg gtg gtg gga ttt etc ata gcg gcc gtt ccg ttc acc aga gag gat 1632 

Trp Val Val Gly Phe Leu He Ala Ala Val Pro Phe Thr Arg Glu Asp 
530 535 540 

tat ttc ggc aac ttt tat ggg aaa aat gga gtc tgc ttc cca ctt cat 1680 

Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu His 

545 550 555 560 

tat gac caa gca gaa gat ttt gga agt aga ggg tac tec ctt ggg att 1728 

Tyr Asp Gin Ala Glu Asp Phe Gly Ser Arg Gly Tyr Ser Leu Gly He 
565 570 575 

ttc eta ggt gtg aac ttg ctg get ttc etc gtc ate gtg att tec tat 1776 

Phe Leu Gly Val Asn Leu Leu Ala Phe Leu Val He Val He Ser Tyr 
580 585 590 

gtc acc atg ttc tgc tec att cat aaa aca gcc ctt cag act gca gaa 1824 

Val Thr Met Phe Cys Ser He His Lys Thr Ala Leu Gin Thr Ala Glu 
595 600 605 

gtg agg age cac ate ggg aag gag gtg get gtt gca aac egg ttc ttt 1872 

Val Arg Ser His He Gly Lys Glu Val Ala Val Ala Asn Arg Phe Phe 
610 615 620 
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FIG. 8D 



ttt ate gtg ttc tct gat gec ate tgc tgg ate cct gtg ttt gtc gtt 1920 

Phe lie Val Phe Ser Asp Ala lie Cys Trp lie Pro Val Phe Val Val 

625 630 635 640 

aag ate ctg tct etc ctt caa gtg gag ata cca ggc aca ate act tec 1968 

Lys lie Leu Ser Leu Leu Gin Val Glu lie Pro Gly Thr lie Thr Ser 

645 650 655 

tgg ate gtg gtt ttt ttc ctt ccg gtg aac age gee tta aac ccc ate 2016 

Trp lie Val Val Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro lie 

660 665 670 

etc tac act ctg acg acc tec ttt ttt aag gac aag ttg aaa cag ttg 2064 

Leu Tyr Thr Leu Thr Thr Ser Phe Phe Lys Asp Lys Leu Lys Gin Leu 

675 680 685 

ctg cac aaa cat egg agg aaa ccc ate ttc aaa gtt aag aag aaa agt 2112 

Leu His Lys His Arg Arg Lys Pro lie Phe Lys Val Lys Lys Lys Ser 

690 695 700 

tta tec get tec att gtg tgg aca gac gag tct tea ctt aaa ctt gga 2160 

Leu Ser Ala Ser lie Val Trp Thr Asp Glu Ser Ser Leu Lys Leu Gly 

705 710 715 720 

gtg ttg age aaa ata gee ctt ggg gac agt ata atg aag ccg gtc tec 2208 

Val Leu Ser Lys lie Ala Leu Gly Asp Ser lie Met Lys Pro Val Ser 

725 730 735 

ccg tag 2214 
Pro 
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FIG. 9A 



gac age agt atg gtg get cct ctg tgc ccc aaa ggg tat ttt ccc tgt 48 

Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr Phe Pro Cys 

15 10 15 

ggg aat etc ace aaa tgc ttg ccc cga gec ttt cac tgc gat ggt gtg 96 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Val 

20 25 30 

gat gat tgc ggg aat ggt gec gac gag gac aac tgt ggt gac act agt 14 4 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly Asp Thr Ser 

35 40 45 

gga tgg aca ace ata ttt ggc aca gtc cat ggg aat gtc aat aaa gtg 192 

Gly Trp Thr Thr lie Phe Gly Thr Val His Gly Asn Val Asn Lys Val 

50 55 60 

aca ttg aca cag gag tgc ttt etc age cag tat cca cag cac tgt tac 240 

Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin His Cys Tyr 

65 70 75 ' 80 

tgc aga gaa aat gaa ctg gaa tgt gta aag get gac tta aaa get gtg 288 

Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu Lys Ala Val 

85 90 95 

cca aag gtt tec age aac gta aca tta eta tct ctt aag aaa aac aaa 336 

Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 

100 105 110 

ate cac aga ctt cca gtc aag gtc ttc age aga tac aca gaa etc aga 384 

lie His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr Glu Leu Arg 

115 120 125 

aag ata tac ctt cag cac aac tgc ate aca cac ate tec agg aga gca 432 

Lys lie Tyr Leu Gin His Asn Cys lie Thr His lie Ser Arg Arg Ala 

130 135 140 

ttc ctt gga tta cat aat eta caa ata ctg tat etc age cat aac tgc 480 

Phe Leu Gly Leu His Asn Leu Gin lie Leu Tyr Leu Ser His Asn Cys 

145 150 155 160 

att acc tct etc agg cct ggg ata ttc aaa gac ttg cat cag ctt get 528 

lie Thr Ser Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Ala 

165 170 175 

tgg eta att tta gat gac aac ccg ate acc aga ate tea cag aag tec 576 

Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Lys Ser 

180 185 190 



ttt atg ggg tta aac tec ttg ttt ttc ttg tec atg gtg ggt aac egg 
Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Gly Asn Arg 
195 200 205 



624 
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FIG. 9B 



etc gag gec 
Leu GIu Ala 
210 

tgg gtg gat 
Trp Val Asp 
225 

ttc eta acg 
Phe Leu Thr 



att ggt ttt 
lie Gly Phe 



gaa ctg gac 
Glu Leu Asp 
275 

ttc age gac 
Phe Ser Asp 
290 

ctt ctg tat 
Leu Leu Tyr 
305 

tct eta gac 
Ser Leu Asp 



ttc cag cca 
Phe Gin Pro 



tac tgc tec 
Tyr Cys Ser 
355 

ggt att tct 
Gly lie Ser 
370 



ctt cct gaa 
Leu Pro Glu 



ctg gca aac 
Leu Ala Asn 
230 

tgc gac teg 
Cys Asp Ser 
245 

gtt cca gag 

Val Pro Glu 
260 

ctg tct age 
Leu Ser Ser 



ctt cat ctt 
Leu His Leu 



gtc cac aag 
Val His Lys 
310 

ctg gaa agg 
Leu Glu Arg 
325 

atg aag aac 
Met Lys Asn 
340 

tat gtc ccc 
Tyr Val Pro 



teg tct gag 
Ser Ser Glu 



aca ttg tgt 
Thr Leu Cys 
215 

aat gga ata 
Asn Gly lie 



etc acg gtt 
Leu Thr Val 



aag aca ttt 

Lys Thr Phe 
265 

aat atg ata 
Asn Met lie 

280 

etc cag aag 
Leu Gin Lys 
295 

aac cag ttt 
Asn Gin Phe 



ata gag att 
lie Glu lie 



ctt tct cac 
Leu Ser His 
345 

cat gtc cga 
His Val Arg 
360 

gac etc ttg 
Asp Leu Leu 
375 



get cag atg 
Ala Gin Met 
220 

aag tac ata 
Lys Tyr lie 
235 

ctg ttt ctg 
Leu Phe Leu 
250 

tct tea tta 
Ser Ser Leu 



aca aaa etc 
Thr Lys Leu 



ctg aac ctg 
Leu Asn Leu 
300 

gga agt etc 
Gly Ser Leu 
315 

cca aac ata 
Pro Asn lie 
330 

att tat ttg 
lie Tyr Leu 



ate tgt atg 
lie Cys Met 



get aac ggt 
Ala Asn Gly 
380 



cct caa etc 
Pro Gin Leu 



acg aac tec 
Thr Asn Ser 



cct aga aat 
Pro Arg Asn 

255 

aaa aat tta 
Lys Asn Leu 

270 

cca gtc cac 
Pro Val His 
285 

tea tec aac 
Ser Ser Asn 



aaa caa ctt 
Lys Gin Leu 



age aca gga 
Ser Thr Gly 
335 

aaa acc ttt 
Lys Thr Phe 
350 

ccg teg act 
Pro Ser Thr 
365 



aac 672 
Asn 



acc 720 

Thr 

240 

caa 768 
Gin 



gga 816 
Gly 



ctt 864 
Leu 



cct 912 
Pro 



cag 960 

Gin 

320 

atg 1008 
Met 



cga 1056 
Arg 



gat 1104 
Asp 



1140 
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FIG. 10A 



1 MI VFLVFKHLFSLRLXTMFFLLHFIVL I NVKDFALTQGSMIT PS CQKGYF 50 

.1:11 II. I : I II! 
1 MTSGSVFFYILIFGKYFSHGGGQDV . . KCSLGYF 32 



51 PCGNLTKCLPRAFHCDGKDDCGNGADEENCGDTSGWATI FGTVHGNANSV 100 

1111:11111. 11 . 1 I M 1 1 I I I : I I I I . I 1 - I 
33 PCGNITKCLPQLLHCNGVDDCGNQADEDNCGDNNGWSMQFDKYFASYYKM 8 2 



101 ALTQECFLKQYPQCCDCKETELECVNGDLKSVPMI SNNVTLLS 14 3 

I I 1 I - I II. 1 I : I . I : - I I : I . I I I : I 
8 3 TSQYPFEAETPECLVGSVPVQCLCQGLELDCDETNLRAVPSVSSNVTAMS 132 



14 4 LKKNKIHSLPDKVFIKYTKLKKIFLQHNCIRHISRKAFFGLCNLQILYLN 193 

I . I I II ! I i . I : : M . I I I I II II • I IN. 
133 LQWNLIRKLPPDCFKNYHDLQKLYLQNNKITSISIYAFRGLMSLTKLYLS 182 



194 HNCITTLRPGIFKDLHQLTWLILDDNPITRISQRLFTGLNSLFFLSMVNN 24 3 

M II I : I I : I . I I I . I I I I : : I I : . I I I 1 Mill I : . I I 
183 HNRITFLKPGVFEDLHRLEVJLI IEDNHLSRISPPTFYGLNSLILLVLMNN 232 



24 4 YLEALP. KQMCAQMPQLNWVDLEGNRIKYLTNSTFLSCDSLTVLFLPRNQ 2 92 

I 111:1 I 1 . I - I • I I I 1 I I I I I I : I I . I I 1 I : : I ■ 
233 VLTRLPDKPLCQHMPRLHWLDLEGNHIHNLRNLTFISCSNLTVLVMRKNK 282 



2 93 IGFVPEKTFSSLKNLGELDLSSNTITELSPHLFKDLKLLQKLNLSSNPLM 34 2 

I . I I I - I . I I I ! I I I I I I : 1 I I 1 I I . 1 I I I I I : 
28 3 INHLNENTFAPLQKLDELDLGSKKIENLPPLIFKDLKELSQLNLSYNPIQ 332 



34 3 YLHKNQFESLKQLQSLDLERIEIPNINTRMFQPMKNLSHIYFKNFRYCSY 392 

: Ml: I .(Ml M Mi II II ( M : I I M 1 I II (Mi i 
333 KIQANQFDYLVKLKSLSLEGIEISNIQQRMFRPLMNLSHI YFKKFQYCGY 382 



393 APHVRICMPLTDGISSFEDLLANNILRIFVWVIAFITCFGNLFVIGMRSF 4 42 

Mill I I II M M Mill. I Mill:. M II II M I 1 II : 
38 3 APHVRSCKPNTDGISSLENLLASIIQRVFVWWSAVTCFGNIFVICMRPY 432 



44 3 I KAENTT HAMS I KI LCCADCLMGVYLFFVG I FDI KYRGQ YQKYALLWMES 4 92 

I : M I Mill I M II I I M M I I M M M : M M Ml II I M 
433 I RSENKLYAMS I I SLCCADCLMGI YLFVIGGFDLKFRGEYNKHAQLWMES 482 



4 93 VQCRLMGFLAMLSTEVSVLLLTYLTLEKFLVIVFPFSNIRPGKRQTSVIL 54 2 

I M M I 1 M M II II M 11 M M I I: : M M I Mill - 1 M 
483 THCQLVGSLAILSTEVSVLLLTFLTLEKYICIVYPFRCVRPGKCRTITVL 532 
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FIG. 10B 



54 3 ICIWMAGFLIAVI PFWNKDYFGN FYGKNGVCF PLY YDQTE DIGS KG YSLG 592 

I i I . M : : I i i I i : : I 1:11 I ! ! II 1 [ : : I I I I ■ • I I . 
533 ILIWITGFIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVA 582 



593 IFLGVNLLAFLIIVFSYITMFCSIQKTALQTTEVRNCFGREVAVANRFFF 642 

I 1 I I : ! 1 11:111111 -II I : ■ . 1 : 11:11 : 1 . M I I I I 
583 I FLGINLAAFI II VFSYGSMFYSVHQSAITATEIRNQVKKEMILAKRFFF 632 



64 3 IVFSDAICWIPVFWKILSLFRVEIPDTMTSWIVIFFLPVNSALNPILYT 692 

I ! I . I! : I t M : 1 1 I I I I I -llll 1.111:111 I 1 : I i I I 1 I I M 1 
633 IVFTDALCWIPIFWKFLSLLQVEIPGTITSWVVIFILPINSALNPILYT 682 



693 LTTNFFKDKLKQLLHKH . QRKSI FKIKKKSLSTSIVWIEDSSSLKLGVLN 741 

ill II::. : : MM. M . . t : I : I : ! 

683 LTTRPFKEMIHRFWYNYRQRKSMDSKGQKTYAPSFIWVE MWPLQ 726 



742 KITLGDSIMKPVS 754 

. : MM 
7 27 E. . MPPELMKPDLFTYPCEMSLISQSTRLNSYS 7 57 
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FIG. 11A 



1 QGSMITPSCQKGYFPCGNLTKCLPRAFHCDGKDDCGNGADEENCGDTSGW 50 

II: 1 1 I i I I I I 1 1 i M 1 I M 1 M I ! I M I I I 1 I I 1 : I I i I M 1 I 
1 DSS^APLCPKGYFPCGNLTKCLPRAFHCDGVDDCGNGADEDNCGDTSGW 5 0 



51 ATIFGTVHGNANSVALTQECFLKQYPQCCDCKETELECVNGDLKSVPMIS 100 

I M I 1 ! I II I I 1 I M M I MM I MM I M M M M M M 
51 TTIFGTVHGNVNKVTLTQECFLSQYPQHCYCRENELECVKADLKAVPKVS 100 



101 NNVTLLSLKKNKIHSLPDKVFIKYTKLKKIFLQHNCIRHISRKAFFGLCN 150 

M M M M M M M M Ml M M M M M M M I M I I M I M 1 
101 SNVTLLSLKKNKIHRLPVKVFSRYTELRKI YLQHNCITHISRRAFLGLHN 150 



151 LQILYLNHNCITTLRPGIFKDLHQLTWLILDDNPITRISQRLFTGLNSLF 200 

M M I M M M I M M M M 1 M M I I I I I I I I I ! II M : I M I M I 
151 LQILYLSHNCITSLRPGIFKDLHQLAWLILDDNPITRISQKS FMGLNSLF 200 



201 FLSMVNNYLEALPKQMCAQMPQLNWVDLEGNRIKYLTNSTFLSCDSLTVL 250 

M M I I M ! M . M M M M M M I I I I I : I I I I I I • I I I I I I I 
201 FLSMVGNRLEALPETLCAQMPQLNWVDLANNGIKYITNSTFLTCDSLTVL 25 0 



251 FLPRNQIGFVPEKTFSSLKNLGELDLSSNTITELSPHLFKDLKLLQKLNL 30 0 

M I M M M M 1 M t M M M M M M M t M 1 Ml II II M 1 M 
251 FLPRNQIGFVPEKTFSSLKNLGELDLSSNMITKLPVHLFSDLHLLQKLNL 300 



301 SSNPLMYLHKNQFESLKQLQSLDLERIEIPNINTRMFQPMKNLSHIYFKN 350 

M I M M M I I II I 1 I M II 1 II I I I I II I M I M M M II M I 1 I 
301 SSNPLLYVHKNQFGSLKQLQSLDLERIEIPNISTGMFQPMKNLSHIYLKT 350 



351 FRYCSYAPHVRICMPLTDGISSFEDLLANNILRIFVWVIAFITCFGNLFV 4 00 

M M II I M I I I I I 1 II M I M II M MM 1 M I M II I M I 
351 FRYCSYVPHVRICMPSTDGISSSEDLLANGILRVSVWVIAFITCVGNFLV 4 00 



401 IGMRS FIKAENTTHAMS I KILCCADCLMGVYLFFVGI FDI KYRGQYQKYA 450 

I Ml M M II M I M M M II I M I M I M 1 M M II i II I M M 1 I 
401 IAVRSLIKAENTTHAMSIKILCCADCLMGVYLFSVGVFDI KYRGQYQKYA 4 50 



451 LLWMESVQCRLMGFLAMLSTEVSVLLLTYLTLEKFLVIVFPFSNIRPGKR 500 

II I I! I I II I M I M M II M M M I M M II 11 M II M M M III 
451 LLWMESVPCRLLGFLATLSTEVSVLLLTFLTLEKFLVIVFPFSNLRLGKR 500 



501 QTSVILICIWMAGFLIAVIPFWNKDYFGNFYGKNGVCFPLYYDQTEDIGS 5 50 

MM I 1M Mill Ml M II M I M II II I I M M M 11 I I 
501 QTAVALASIWWGFLIAAVPFTREDYFGNFYGKNGVCFPLHYDQAEDFGS 5 50 
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FIG. 11B 

551 KGYSLGIFLGVNLLAFLIIVFSYITMFCSIQKTALQTTEVRNCFGREVAV 600 

: I I i M II I i I I I I M I : II I I : M I I I I MINI III. I : ! I i I 
551 RGYSLGI FLGVNLLAFLVIVISYVTMFCSIHKTALQTAEVRSHIGKEVAV 600 

601 ANRFFFIVFSDAICWIPVFVVKILSLFRVEIPDTMTSWIVIFFLPVNSAL 650 

I I II I I I I I I II t I i II I I II 1 i I I I .1111 I . I I I I I : I I I I I I 1 I I 
601 ANRFFFIVFSDAICWIPVFVVKILSLLQVEI PGTITSWIVVFFLPVNSAL 650 

651 NPILYTLTTNFFKDKLKQLLHKHQRKSIFKIKKKSLSTSIVWIEDSSSLK 700 

IIIIIIMI.IIMMIIIMII.il I! I : I M M 1 I M I I MM 
651 NPILYTLTTSFFKDKLKQLLHKHRRKPI FKVKKKSLSASIVW . TDESSLK 699 

701 LGVLNKITLGDSIMKPVS . . 718 

I II I . I I I I! I M M I ! 
7 00 LGVLSKIALGDS IMKPVS P 718 
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FIG. 12 



1 QGSMITPSCQKGYFPCGNLTKCLPRAFHCDGKDDCGNGADEEISICGDTSGW 50 

II: I I I I I I i i M I II I i ! I I I I M i i I i I I I I I i : I i I I I i I I 
1 DSSMVAPLCPKGYFPCGNLTKCLPRAFHCDGVDDCGNGADEDNCGDTSGW 50 



51 ATIFGTVHGNANSVALTQECFLKQYPQCCDCKETELECVNGDLKSVPMIS 100 

i I I i I I 1 1 I i 1 I I I 1 1 I 1 Mil I 1:1 II I I I 111.11 : I 
51 TTIFGTVHGNVNKVTLTQECFLSQYPQHCYCRENELECVKADLKAVPKVS 100 



101 NNVTLLSLKKNKIHSLPDKVFIKYTKLKKIFLQHNCIRHISRKAFFGLCN 150 

. M f 11 I I I I I I I I II Ml : M M : II : i M I 11 II I I : I 1 II I 
101 SNVTLLSLKKNKIHRLPVKVFSRYTELRKIYLQHNCITHISRRAFLGLHN 150 



151 LQILYLNHNCITTLRPGIFKDLHQLTWLILDDNPITRISQRLFTGLNSLF 200 

j I I 1 M • II M I . I I I M II 1 I M I I I I I I I I I 11 I M I : 1 MINI 
151 LQILYLSHNCITSLRPGIFKDLHQLAWLILDDNPITRISQKSFMGLNSLF 200 



201 FLSMVNNYLEALPKQMCAQMPQLNWVDLEGNRIKYLTNSTFLSCDSLTVL 250 

M M I ( M M I . : I I I I I I I M II I I I II : I I I M I ■ I M I I I I 
201 FLSMVGNRLEALPETLCAQMPQLNWVDLANNGIKYITNSTFLTCDSLTVL 250 



251 FLPRNQIGFVPEKTFSSLKNLGELDLSSNTITELSPHLFKDLKLLQKLNL 300 

I II M I II I I I I I M I II I I I I II I M I I 1 M I III II I 1 II 1 I I 
251 FLPRNQIGFVPEKTFSSLKNLGELDLSSNMITKLPVHLFSDLHLLQKLNL 300 



301 SSNPLMYLHKNQFESLKQLQSLDLERIEIPNINTRMFQPMKNLSHIYFKN 350 

I I II i : I . I I I I I I I II I M II I I M I M I I . I M I I M I II M I I 
301 SSNPLLYVHKMQFGSLKQLQSLDLERIEIPNISTGMFQPMKNLSHIYLKT 350 



351 FRYCSYAPHVRICMPLTDGISSFEDLLANNILR 383 

M II 11 I 1 I 1 ! I M M 11 M II ! I I I 
351 FRYCSYVPHVRICMPSTDGISSSEDLLANG . . . 380 
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SEQUENCE LISTING 

<110> Paszty, Christopher J. 
Gong, Jianhua 
Daugherty , Betsy 
Rogers , Norma 

<120> Leucine-Rich G Protein Coupled Receptor-8 Molecules and 
Uses Thereof 

<130> 00-1229-A 

<140> 
<141> 

<150> 60/224, 455 
<151> 2000-08-10 

<160> 42 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2265 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2262) 

<220> 

<221> sig__peptide 
<222> (1) . . (108) 

<400> 1 

atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 4 8 
Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 
15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 
Thr Met Phe Phe Leu Leu His Phe He Val Leu He Asn Val Lys Asp 
20 25 30 

ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 14 4 
Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 
35 40 45 

tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 192 
Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 240 
Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 
65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 
Gly Asp Thr Ser Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn 
85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 



1 
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Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 

100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 

115 120 125 



tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 
Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 



gtt aat aac tac tta gaa get ctt ccc aag cag atg tgt gec caa atg 
Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 
245 250 255 



cct aga aat caa att ggt ttt gtt cca gag aag aca ttt tct tea tta 
Pro Arg Asn Gin lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu 
290 295 300 



432 



aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 

145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 576 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 
180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 
195 200 205 

cat cag eta act tgg eta att eta gat gac aat cca ata acc aga att 672 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie 

210 215 220 

tea cag cgc ttg ttt acg gga tta aat tec ttg ttt ttc ctg tct atg 720 

Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 

225 230 235 240 



768 



cct caa etc aac tgg gtg gat ttg gaa ggc aat aga ata aag tat etc 816 

Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu 
260 265 270 

aca aat tct acg ttt ctg teg tgc gat teg etc aca gtg ctg ttt ctg 864 

Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu 
275 280 285 



912 



aaa aat tta gga gaa ctg gat ctg tct age aat acg ata acg gag eta 960 

Lys Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu 

305 310 315 320 

tea cct cac ctt ttt aaa gac ttg aag ctt eta caa aag ctg aac ctg 1008 

Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu 

325 330 335 

tea tec aat cct ctt atg tat ctt cac aag aac cag ttt gaa agt ctt 1056 

Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu 
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340 345 350 

aaa caa ctt cag tct eta gac ctg gaa agg ata gag att cca aat ata 1104 

Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He 

355 360 365 

aac aca cga atg ttt caa ccc atg aag aat ctt tct cac att tat ttc 1152 

Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser His lie Tyr Phe 

370 375 380 



aaa aac ttt cga tac tgc tec tat get ccc cat gtc cga ata tgt atg 
Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val Arg He Cys Met 
385 390 395 400 



ate etc aga ata ttt gtc tgg gtt ata get ttc att acc tgc ttt gga 
He Leu Arg lie Phe Val Trp Val He Ala Phe He Thr Cys Phe Gly 
420 425 430 



1200 



ccc ttg acg gac ggc att tct tea ttt gag gac etc ttg get aac aat 1248 
Pro Leu Thr Asp Gly He Ser Ser Phe Glu Asp Leu Leu Ala Asn Asn 
405 410 415 



1296 



aat ctt ttt gtc att ggc atg aga tct ttc att aaa get gaa aat aca 1344 
Asn Leu Phe Val lie Gly Met Arg Ser Phe lie Lys Ala Glu Asn Thr 
435 440 445 

act cac get atg tec ate aaa ate ctt tgt tgt get gat tgc ctg atg 1392 
Thr His Ala Met Ser He Lys He Leu Cys Cys Ala Asp Cys Leu Met 
450 455 460 

ggt gtt tac ttg ttc ttt gtt ggc att ttc gat ata aaa tac cga ggg 1440 
Gly Val Tyr Leu Phe Phe Val Gly He Phe Asp He Lys Tyr Arg Gly 
465 470 475 480 

cag tat cag aag tat gee ttg ctg tgg atg gag age gtg cag tgc cgc 14 88 
Gin Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Gin Cys Arg 
485 490 495 

etc atg ggg ttc ctg gee atg ctg tec acc gaa gtc tct gtt ctg eta 1536 
Leu Met Gly Phe Leu Ala Met Leu Ser Thr Glu Val Ser Val Leu Leu 
500 505 510 

ctg acc tac ttg act ttg gag aag ttc ctg gtc att gtc ttc ccc ttc 1584 
Leu Thr Tyr Leu Thr Leu Glu Lys Phe Leu Val lie Val Phe Pro Phe 
515 520 525 

agt aac att cga cct gga aaa egg cag acc tea gtc ate etc att tgc 1632 
Ser Asn He Arg Pro Gly Lys Arg Gin Thr Ser Val He Leu lie Cys 
530 535 540 

ate tgg atg gcg gga ttt tta ata get gta att cca ttt tgg aat aag 1680 
He Trp Met Ala Gly Phe Leu lie Ala Val He Pro Phe Trp Asn Lys 
545 550 555 560 

gat tat ttt gga aac ttt tat ggg aaa aat gga gta tgt ttc cca ctt 1728 
Asp Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu 
565 570 575 

tat tat gac caa aca gaa gat att gga age aaa ggg tat tct ctt gga 1776 
Tyr Tyr Asp Gin Thr Glu Asp He Gly Ser Lys Gly Tyr Ser Leu Gly 
580 585 590 
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att ttc eta ggt gtg aac ttg ctg get ttt etc ate att gtg ttt tec 1824 

lie Phe Leu Gly Val Asn Leu Leu Ala Phe Leu lie lie Val Phe Ser 

595 600 605 

tat att act atg ttc tgt tec att caa aaa ace gee ttg cag ace aca 1872 

Tyr lie Thr Met Phe Cys Ser He Gin Lys Thr Ala Leu Gin Thr Thr 

610 615 620 

gaa gta agg aat tgt ttt gga aga gag gtg get gtt gca aat cgt ttc 1920 

Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val Ala Asn Arg Phe 

625 630 635 640 

ttt ttt ata gtg ttc tct gat gec ate tgc tgg att cct gta ttt gta 1968 

Phe Phe He Val Phe Ser Asp Ala He Cys Trp He Pro Val Phe Val 

645 650 655 

gtt aaa ate ctt tec etc ttc egg gtg gaa ata cca gac aca atg act 2016 

Val Lys He Leu Ser Leu Phe Arg Val Glu He Pro Asp Thr Met Thr 

660 665 670 

tec tgg ata gtg att ttt ttc ctt cca gtt aac agt get ttg aat cca 2064 

Ser Trp He Val He Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro 

675 680 685 

ate etc tat act etc aca ace aac ttt ttt aag gac aag ttg aaa cag 2112 

He Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp Lys Leu Lys Gin 

690 695 700 

ctg ctg cac aaa cat cag agg aaa tea att ttc aaa att aaa aaa aaa 2160 

Leu Leu His Lys His Gin Arg Lys Ser He Phe Lys He Lys Lys Lys 

705 710 715 720 

agt tta tct aca tec att gtg tgg ata gag gac tec tct tec ctg aaa 2208 

Ser Leu Ser Thr Ser lie Val Trp lie Glu Asp Ser Ser Ser Leu Lys 

725 730 735 

ctt ggg gtt ttg aac aaa ata aca ctt gga gac agt ata atg aaa cca 2256 

Leu Gly Val Leu Asn Lys He Thr Leu Gly Asp Ser He Met Lys Pro 

740 745 750 

gtt tec tag 2265 
Val Ser 



<210> 2 
<211> 754 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 
15 10 15 

Thr Met Phe Phe Leu Leu His Phe He Val Leu lie Asn Val Lys Asp 

20 25 30 

Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 
35 40 45 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 
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50 55 60 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 
65 70 75 80 

Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 
85 90 95 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 

Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 
145 150 155 160 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 
180 185 190 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 
195 200 205 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie 
210 215 220 

Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 
225 230 235 240 

Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 
245 250 255 

Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu 
260 265 270 

Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu 

275 280 285 

Pro Arg Asn Gin lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu 

290 295 300 

Lys Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Thr He Thr Glu Leu 
305 310 315 320 

Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu 
325 330 335 

Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu 
340 345 350 

Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He 
355 360 365 

Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Phe 
370 375 380 
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Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val Arg He Cys Met 
385 390 395 400 

Pro Leu Thr Asp Gly lie Ser Ser Phe GIu Asp Leu Leu Ala Asn Asn 
405 410 415 

He Leu Arg He Phe Val Trp Val He Ala Phe He Thr Cys Phe Gly 
420 425 430 

Asn Leu Phe Val He Gly Met Arg Ser Phe He Lys Ala Glu Asn Thr 
435 440 445 

Thr His Ala Met Ser He Lys He Leu Cys Cys Ala Asp Cys Leu Met 

450 455 460 

Gly Val Tyr Leu Phe Phe Val Gly He Phe Asp lie Lys Tyr Arg Gly 
465 470 475 480 

Gin Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Gin Cys Arg 
485 490 495 

Leu Met Gly Phe Leu Ala Met Leu Ser Thr Glu Val Ser Val Leu Leu 
500 505 510 

Leu Thr Tyr Leu Thr Leu Glu Lys Phe Leu Val He Val Phe Pro Phe 
515 520 525 

Ser Asn He Arg Pro Gly Lys Arg Gin Thr Ser Val He Leu He Cys 
530 535 540 

He Trp Met Ala Gly Phe Leu He Ala Val He Pro Phe Trp Asn Lys 
545 550 555 560 

Asp Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu 
565 570 575 

Tyr Tyr Asp Gin Thr Glu Asp He Gly Ser Lys Gly Tyr Ser Leu Gly 
580 585 590 

He Phe Leu Gly Val Asn Leu Leu Ala Phe Leu He He Val Phe Ser 
595 600 605 

Tyr lie Thr Met Phe Cys Ser lie Gin Lys Thr Ala Leu Gin Thr Thr 
610 615 620 

Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val Ala Asn Arg Phe 
625 630 635 640 

Phe Phe He Val Phe Ser Asp Ala He Cys Trp He Pro Val Phe Val 
645 650 655 

Val Lys He Leu Ser Leu Phe Arg Val Glu lie Pro Asp Thr Met Thr 
660 665 670 

Ser Trp He Val He Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro 

675 680 685 

lie Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp Lys Leu Lys Gin 
690 695 700 
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Leu Leu His Lys 
705 

Ser Leu Ser Thr 

Leu Gly Val Leu 
740 

Val Ser 



His Gin Arg Lys 
710 

Ser lie Val Trp 

725 

Asn Lys lie Thr 



Ser lie Phe Lys 
715 

lie Glu Asp Ser 

730 

Leu Gly Asp Ser 
745 



lie Lys Lys Lys 
720 

Ser Ser Leu Lys 
735 

lie Met Lys Pro 
750 



<210> 3 
<211> 718 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 

50 55 60 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 
115 120 125 

Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

He Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

Trp Leu He Leu Asp Asp Asn Pro lie Thr Arg He Ser Gin Arg Leu 
180 185 190 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
195 200 205 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 
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Trp Val Asp Leu 

225 

Phe Leu Ser Cys 



lie Gly Phe Val 
260 

Glu Leu Asp Leu 
275 

Phe Lys Asp Leu 
290 

Leu Met Tyr Leu 
305 

Ser Leu Asp Leu 



Phe Gin Pro Met 
340 

Tyr Cys Ser Tyr 
355 

Gly lie Ser Ser 
370 

Phe Val Trp Val 
385 

lie Gly Met Arg 



Ser lie Lys lie 
420 

Phe Phe Val Gly 
435 

Tyr Ala Leu Leu 
450 

Leu Ala Met Leu 

465 

Thr Leu Glu Lys 



Pro Gly Lys Arg 

500 

Gly Phe Leu He 
515 

Asn Phe Tyr Gly 
530 

Thr Glu Asp He 



Glu Gly Asn Arg 

230 

Asp Ser Leu Thr 
245 

Pro Glu Lys Thr 



Ser Ser Asn Thr 
280 

Lys Leu Leu Gin 
295 

His Lys Asn Gin 
310 

Glu Arg He Glu 
325 

Lys Asn Leu Ser 



Ala Pro His Val 
360 

Phe Glu Asp Leu 
375 

He Ala Phe He 
390 

Ser Phe He Lys 
405 

Leu Cys Cys Ala 



He Phe Asp He 
440 

Trp Met Glu Ser 
455 

Ser Thr Glu Val 
470 

Phe Leu Val lie 
485 

Gin Thr Ser Val 



Ala Val He Pro 
520 

Lys Asn Gly Val 
535 

Gly Ser Lys Gly 



He Lys Tyr Leu 

235 

Val Leu Phe Leu 

250 

Phe Ser Ser Leu 
265 

He Thr Glu Leu 



Lys Leu Asn Leu 

300 

Phe Glu Ser Leu 
315 

He Pro Asn He 
330 

His He Tyr Phe 
345 

Arg He Cys Met 



Leu Ala Asn Asn 

380 

Thr Cys Phe Gly 
395 

Ala Glu Asn Thr 
410 

Asp Cys Leu Met 
425 

Lys Tyr Arg Gly 



Val Gin Cys Arg 
460 

Ser Val Leu Leu 
475 

Val Phe Pro Phe 
490 

He Leu He Cys 
505 

Phe Trp Asn Lys 



Cys Phe Pro Leu 
540 

Tyr Ser Leu Gly 



Thr Asn Ser Thr 
240 

Pro Arg Asn Gin 
255 

Lys Asn Leu Gly 
270 

Ser Pro His Leu 
285 

Ser Ser Asn Pro 



Lys Gin Leu Gin 

320 

Asn Thr Arg Met 
335 

Lys Asn Phe Arg 
350 

Pro Leu Thr Asp 

365 

He Leu Arg He 



Asn Leu Phe Val 
400 

Thr His Ala Met 
415 

Gly Val Tyr Leu 
430 

Gin Tyr Gin Lys 
445 

Leu Met Gly Phe 



Leu Thr Tyr Leu 
480 

Ser Asn He Arg 
495 

He Trp Met Ala 

510 

Asp Tyr Phe Gly 
525 

Tyr Tyr Asp Gin 



He Phe Leu Gly 
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545 550 555 560 

Val Asn Leu Leu Ala Phe Leu lie lie Val Phe Ser Tyr He Thr Met 
565 570 575 

Phe Cys Ser He Gin Lys Thr Ala Leu Gin Thr Thr Giu Val Arg Asn 
580 585 590 

Cys Phe Gly Arg Glu Val Ala Val Ala Asn Arg Phe Phe Phe lie Val 
595 600 605 

Phe Ser Asp Ala He Cys Trp He Pro Val Phe Val Val Lys He Leu 
610 615 620 

Ser Leu Phe Arg Val Glu He Pro Asp Thr Met Thr Ser Trp He Val 
625 630 635 640 

He Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro lie Leu Tyr Thr 
645 650 655 

Leu Thr Thr Asn Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys 
660 665 670 

His Gin Arg Lys Ser He Phe Lys He Lys Lys Lys Ser Leu Ser Thr 

675 680 685 

Ser He Val Trp He Glu Asp Ser Ser Ser Leu Lys Leu Gly Val Leu 
690 695 700 

Asn Lys He Thr Leu Gly Asp Ser He Met Lys Pro Val Ser 
705 710 715 



<210> 4 
<211> 1149 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1149) 

<400> 4 

caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 4 8 

Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 

15 10 15 

ggg aat ctt acc aag tgc tta ccc cga get ttt cac tgt gat ggc aag 96 
Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 

20 25 30 

gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 14 4 
Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

gga tgg gcg acc ata ttt ggc aca gtg cat gga aat get aac age gtg 192 
Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 
Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
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65 70 75 80 

tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 

85 90 95 



ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 
lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 
115 120 125 



288 



ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 336 
Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 



384 



aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 432 
Lys lie Phe Leu Gin His Asn Cys lie Arg His lie Ser Arg Lys Ala 
130 135 140 

ttt ttt gga tta tgt aat ctg caa ata tta tat etc aac cac aac tgc 480 
Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

ate aca acc etc aga cct gga ata ttc aaa gac tta cat cag eta act 528 
lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

tgg eta att eta gat gac aat cca ata acc aga att tea cag cgc ttg 576 
Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Arg Leu 
180 185 190 

ttt acg gga tta aat tec ttg ttt ttc ctg tct atg gtt aat aac tac 624 
Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
195 200 205 

tta gaa get ctt ccc aag cag atg tgt gee caa atg cct caa etc aac 672 
Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

tgg gtg gat ttg gaa ggc aat aga ata aag tat etc aca aat tct acg 720 
Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 
225 230 235 240 

ttt ctg teg tgc gat teg etc aca gtg ctg ttt ctg cct aga aat caa 768 
Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 
245 250 255 

att ggt ttt gtt cca gag aag aca ttt tct tea tta aaa aat tta gga 816 
lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 
260 265 270 

gaa ctg gat ctg tct age aat acg ata acg gag eta tea cct cac ctt 8 64 
Glu Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu 
275 280 285 

ttt aaa gac ttg aag ctt eta caa aag ctg aac ctg tea tec aat cct 912 
Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 
290 295 300 

ctt atg tat ctt cac aag aac cag ttt gaa agt ctt aaa caa ctt cag 960 
Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu Lys Gin Leu Gin 

305 310 315 320 
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tct eta gac ctg gaa agg at a gag att cca aat at a aac aca cga atg 
Ser Leu Asp Leu Glu Arg lie Glu lie Pro Asn lie Asn Thr Arg Met 
325 330 335 



1008 



trt caa ccc atg aag aat ctt tct cac att tat ttc aaa aac ttt cga 
Phe Gin Pro Met Lys Asn Leu Ser His lie Tyr Phe Lys Asn Phe Arg 
340 345 350 



1056 



tac tgc tec tat get ccc cat gtc cga ata tgt atg ccc ttg acg gac 
Tyr Cys Ser Tyr Ala Pro His Val Arg lie Cys Met Pro Leu Thr Asp 
355 360 365 



1104 



ggc att tct tea ttt gag gac etc ttg get aac aat ate etc aga 
Gly lie Ser Ser Phe Glu Asp Leu Leu Ala Asn Asn lie Leu Arg 
370 375 380 



1149 



<210> 5 
<211> 383 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 

15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 

85 90 95 

Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 

115 120 125 

Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu Asn His Asn Cys 
115 150 155 160 

He Thr Thr Leu Arg Pro Gly He Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He Ser Gin Arg Leu 
180 185 190 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
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195 200 205 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 
225 230 235 240 

Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 
245 250 255 

lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 
260 265 270 

Glu Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu 

275 280 285 

Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 
290 295 300 

Leu Met Tyr Leu His Lys Asn Gin Phe Glu Ser Leu Lys Gin Leu Gin 

305 310 315 320 

Ser Leu Asp Leu Glu Arg lie Glu lie Pro Asn lie Asn Thr Arg Met 
325 330 335 

Phe Gin Pro Met Lys Asn Leu Ser His lie Tyr Phe Lys Asn Phe Arg 
340 345 350 

Tyr Cys Ser Tyr Ala Pro His Val Arg lie Cys Met Pro Leu Thr Asp 
355 360 365 

Gly lie Ser Ser Phe Glu Asp Leu Leu Ala Asn Asn lie Leu Arg 

370 375 380 



<210> 6 

<211> 2193 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2190) 

<220> 

<221> sig__peptide 
<222> (1) . . (108) 



<400> 6 

atg att gtt ttt ctg gtt ttt aaa 

Met lie Val Phe Leu Val Phe Lys 

1 5 

aca atg ttc ttt eta ctt cat ttc 

Thr Met Phe Phe Leu Leu His Phe 
20 



cat etc ttc age etc aga ttg att 48 
His Leu Phe Ser Leu Arg Leu lie 

10 15 

ate gtt ctg ate aat gtc aaa gat 96 
lie Val Leu lie Asn Val Lys Asp 
25 30 



ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 144 
Phe Ala Leu Thr Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly 
35 40 45 
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tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 192 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 240 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 

ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 

Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 

85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 

100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 

115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 

Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 

130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 

145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 

165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 576 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 

180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 

195 200 205 

cat cag eta act tgg eta att eta gat gac aat cca ata acc aga att 672 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie 

210 215 220 

tea cag cgc ttg ttt acg gga tta aat tec ttg ttt ttc ctg tct atg 720 

Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 

225 230 235 240 

gtt aat aac tac tta gaa get ctt ccc aag cag atg tgt gee caa atg 768 

Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 

245 250 255 

cct caa etc aac tgg gtg gat ttg gaa ggc aat aga ata aag tat etc 816 

Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu 

260 265 270 

aca aat tct acg ttt ctg teg tgc gat teg etc aca gtg ctg gat ctg 864 

Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu 

275 280 285 
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tct age aat 
Ser Ser Asn 
290 

aag ctt eta 
Lys Leu Leu 

305 

cac aag aac 
His Lys Asn 



gaa agg ata 
Glu Arg lie 



aag aat ctt 
Lys Asn Leu 
355 

get ccc cat 
Ala Pro His 

370 

ttt gag gac 
Phe Glu Asp 
385 

ata get ttc 
He Ala Phe 



tct ttc att 
Ser Phe He 



ctt tgt tgt 
Leu Cys Cys 
435 

att ttc gat 
He Phe Asp 
450 

tgg atg gag 
Trp Met Glu 
465 

tec acc gaa 
Ser Thr Glu 



ttc ctg gtc 
Phe Leu Val 



cag acc tea 
Gin Thr Ser 
515 

get gta att 



acg ata acg 
Thr He Thr 



caa aag ctg 
Gin Lys Leu 
310 

cag ttt gaa 
Gin Phe Glu 
325 

gag att cca 
Glu lie Pro 
340 

tct cac att 
Ser His He 



gtc cga ata 
Val Arg He 



etc ttg get 
Leu Leu Ala 
390 

att acc tgc 
He Thr Cys 
405 

aaa get gaa 
Lys Ala Glu 
420 

get gat tgc 
Ala Asp Cys 



ata aaa tac 
He Lys Tyr 



age gtg cag 
Ser Val Gin 
470 

gtc tct gtt 
Val Ser Val 
485 

att gtc ttc 
He Val Phe 
500 

gtc ate etc 
Val lie Leu 



cca ttt tgg 



gag eta tea 
Glu Leu Ser 
295 

aac ctg tea 
Asn Leu Ser 



agt ctt aaa 
Ser Leu Lys 



aat ata aac 
Asn He Asn 
345 

tat ttc aaa 
Tyr Phe Lys 
360 

tgt atg ccc 
Cys Met Pro 
375 

aac aat ate 
Asn Asn He 



ttt gga aat 
Phe Gly Asn 



aat aca act 
Asn Thr Thr 
425 

ctg atg ggt 
Leu Met Gly 
440 

cga ggg cag 
Arg Gly Gin 
455 

tgc cgc etc 
Cys Arg Leu 



ctg eta ctg 
Leu Leu Leu 



ccc ttc agt 
Pro Phe Ser 
505 

att tgc ate 
He Cys lie 
520 

aat aag gat 



cct cac ctt 
Pro His Leu 
300 

tec aat cct 
Ser Asn Pro 
315 

caa ctt cag 
Gin Leu Gin 
330 

aca cga atg 
Thr Arg Met 



aac ttt cga 
Asn Phe Arg 



ttg acg gac 
Leu Thr Asp 
380 

etc aga ata 
Leu Arg He 
395 

ctt ttt gtc 
Leu Phe Val 
410 

cac get atg 
His Ala Met 



gtt tac ttg 
Val Tyr Leu 



tat cag aag 
Tyr Gin Lys 
460 

atg ggg ttc 
Met Gly Phe 
475 

acc tac ttg 
Thr Tyr Leu 

490 ' 

aac att cga 
Asn He Arg 



tgg atg gcg 
Trp Met Ala 



tat ttt gga 



ttt aaa gac 
Phe Lys Asp 



ctt atg tat 
Leu Met Tyr 



tct eta gac 
Ser Leu Asp 
335 

ttt caa ccc 
Phe Gin Pro 

350 

tac tgc tec 

Tyr Cys Ser 
365 

ggc att tct 

Gly He Ser 



ttt gtc tgg 
Phe Val Trp 



att ggc atg 
He Gly Met 
415 

tec ate aaa 
Ser He Lys 
430 

ttc ttt gtt 
Phe Phe Val 
445 

tat gec ttg 
Tyr Ala Leu 



ctg gec atg 
Leu Ala Met 



act ttg gag 
Thr Leu Glu 
495 

cct gga aaa 
Pro Gly Lys 
510 

gga ttt tta 
Gly Phe Leu 
525 

aac ttt tat 



ttg 912 
Leu 



ctt 960 

Leu 

320 

ctg 1008 
Leu 



atg 1056 
Met 



tat 1104 
Tyr 



tea 1152 
Ser 



gtt 1200 

Val 

400 

aga 1248 
Arg 



ate 1296 
He 



ggc 1344 
Gly 



ctg 1392 
Leu 



ctg 1440 

Leu 

480 

aag 1488 
Lys 



egg 1536 
Arg 



ata 1584 
lie 



ggg 1632 
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Ala Val lie Pro Phe Trp Asn Lys Asp Tyr Phe Gly Asn Phe Tyr Gly 
530 535 540 

aaa aat gga gta tgt ttc cca ctt tat tat gac caa aca gaa gat att 1680 
Lys Asn Gly Val Cys Phe Pro Leu Tyr Tyr Asp Gin Thr Glu Asp lie 
545 550 555 560 

gga age aaa ggg tat tct ctt gga att ttc eta ggt gtg aac ttg ctg 1728 
Gly Ser Lys Gly Tyr Ser Leu Gly lie Phe Leu Gly Val Asn Leu Leu 
565 570 575 

get ttt etc ate att gtg ttt tec tat att act atg ttc tgt tec att 1776 
Ala Phe Leu lie lie Val Phe Ser Tyr lie Thr Met Phe Cys Ser lie 
580 585 590 

caa aaa acc gec ttg cag ace aca gaa gta agg aat tgt ttt gga aga 1824 
Gin Lys Thr Ala Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg 
595 600 605 

gag gtg get gtt gca aat cgt ttc ttt ttt ata gtg ttc tct gat gee 1872 
Glu Val Ala Val Ala Asn Arg Phe Phe Phe lie Val Phe Ser Asp Ala 
610 615 620 

ate tgc tgg att cct gta ttt gta gtt aaa ate ctt tec etc ttc egg 1920 
He Cys Trp He Pro Val Phe Val Val Lys He Leu Ser Leu Phe Arg 
625 630 635 640 

gtg gaa ata cca gac aca atg act tec tgg ata gtg att ttt ttc ctt 1968 
Val Glu He Pro Asp Thr Met Thr Ser Trp He Val He Phe Phe Leu 
645 650 655 

cca gtt aac agt get ttg aat cca ate etc tat act etc aca acc aac 2016 
Pro Val Asn Ser Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn 
660 665 670 

ttt ttt aag gac aag ttg aaa cag ctg ctg cac aaa cat cag agg aaa 2064 
Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys 
675 680 685 

tea att ttc aaa att aaa aaa aaa agt tta tct aca tec att gtg tgg 2112 
Ser He Phe Lys He Lys Lys Lys Ser Leu Ser Thr Ser He Val Trp 
690 695 700 

ata gag gac tec tct tec ctg aaa ctt ggg gtt ttg aac aaa ata aca 2160 
He Glu Asp Ser Ser Ser Leu Lys Leu Gly Val Leu Asn Lys He Thr 
705 710 715 720 

ctt gga gac agt ata atg aaa cca gtt tec tag 2193 
Leu Gly Asp Ser He Met Lys Pro Val Ser 
725 730 



<210> 7 
<211> 730 
<212> PRT 

<2 13> Homo sapiens 
<400>. 7 

Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 
15 10 15 



15 



WO 02/14489 



PCT/LSOl/25054 



Thr Met Phe Phe 
20 

Phe Ala Leu Thr 
35 

Tyr Phe Pro Cys 

50 

Cys Asp Gly Lys 
65 

Gly Asp Thr Ser 



Ala Asn Ser Val 
100 

Gin Cys Cys Asp 
115 

Leu Lys Ser Val 
130 

Lys Lys Asn Lys 
145 

Thr Lys Leu Lys 



Ser Arg Lys Ala 
180 

Asn His Asn Cys 
195 

His Gin Leu Thr 
210 

Ser Gin Arg Leu 

225 

Val Asn Asn Tyr 



Pro Gin Leu Asn 
260 

Thr Asn Ser Thr 

275 

Ser Ser Asn Thr 

290 

Lys Leu Leu Gin 
305 

His Lys Asn Gin 



Glu Arg lie Glu 



Leu Leu His Phe 



Gin Gly Ser Met 

40 

Gly Asn Leu Thr 
55 

Asp Asp Cys Gly 
70 

Gly Trp Ala Thr 
85 

Ala Leu Thr Gin 



Cys Lys Glu Thr 
120 

Pro Met lie Ser 
135 

lie His Ser Leu 
150 

Lys lie Phe Leu 
165 

Phe Phe Gly Leu 



lie Thr Thr Leu 
200 

Trp Leu lie Leu 
215 

Phe Thr Gly Leu 
230 

Leu Glu Ala Leu 
245 

Trp Val Asp Leu 



Phe Leu Ser Cys 

280 

lie Thr Glu Leu 
295 

Lys Leu Asn Leu 
310 

Phe Glu Ser Leu 
325 

lie Pro Asn lie 



lie Val Leu lie 

25 

He Thr Pro Ser 



Lys Cys Leu Pro 
60 

Asn Gly Ala Asp 
75 

He Phe Gly Thr 

90 

Glu Cys Phe Leu 

105 

Glu Leu Glu Cys 



Asn Asn Val Thr 
140 

Pro Asp Lys Val 
155 

Gin His Asn Cys 

170 

Cys Asn Leu Gin 
185 

Arg Pro Gly He 



Asp Asp Asn Pro 
220 

Asn Ser Leu Phe 

235 

Pro Lys Gin Met 
250 

Glu Gly Asn Arg 
265 

Asp Ser Leu Thr 



Ser Pro His Leu 
300 

Ser Ser Asn Pro 
315 

Lys Gin Leu Gin 
330 

Asn Thr Arg Met 



Asn Val Lys Asp 
30 

Cys Gin Lys Gly 
45 

Arg Ala Phe His 



Glu Glu Asn Cys 

80 

Val His Gly Asn 

95 

Lys Gin Tyr Pro 
110 

Val Asn Gly Asp 
125 

Leu Leu Ser Leu 



Phe He Lys Tyr 

160 

He Arg His He 
175 

He Leu Tyr Leu 
190 

Phe Lys Asp Leu 
205 

He Thr Arg He 



Phe Leu Ser Met 

240 

Cys Ala Gin Met 
255 

He Lys Tyr Leu 
270 

Val Leu Asp Leu 
285 

Phe Lys Asp Leu 



Leu Met Tyr Leu 
320 

Ser Leu Asp Leu 

335 

Phe Gin Pro Met 
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340 

Lys Asn Leu Ser 
355 

Ala Pro His Val 
370 

Phe Glu Asp Leu 

385 

He Ala Phe He 



Ser Phe He Lys 
420 

Leu Cys Cys Ala 
435 

He Phe Asp He 

450 

Trp Met Glu Ser 
4 65 

Ser Thr Glu Val 



Phe Leu Val He 
500 

Gin Thr Ser Val 

515 

Ala Val He Pro 
530 

Lys Asn Gly Val 
545 

Gly Ser Lys Gly 



Ala Phe Leu lie 
580 

Gin Lys Thr Ala 
595 

Glu Val Ala Val 
610 

He Cys Trp He 
625 

Val Glu He Pro 



Pro Val Asn Ser 
660 



His He Tyr Phe 
360 

Arg He Cys Met 
375 

Leu Ala Asn Asn 
390 

Thr Cys Phe Gly 
405 

Ala Glu Asn Thr 



Asp Cys Leu Met 
440 

Lys Tyr Arg Gly 

455 

Val Gin Cys Arg 

470 

Ser Val Leu Leu 
485 

Val Phe Pro Phe 



He Leu lie Cys 

520 

Phe Trp Asn Lys 
535 

Cys Phe Pro Leu 
550 

Tyr Ser Leu Gly 
565 

He Val Phe Ser 



Leu Gin Thr Thr 
600 

Ala Asn Arg Phe 
615 

Pro Val Phe Val 
630 

Asp Thr Met Thr 
645 

Ala Leu Asn Pro 



345 

Lys Asn Phe Arg 



Pro Leu Thr Asp 
380 

lie Leu Arg lie 
395 

Asn Leu Phe Val 
410 

Thr His Ala Met 

425 

Gly Val Tyr Leu 



Gin Tyr Gin Lys 
460 

Leu Met Gly Phe 
475 

Leu Thr Tyr Leu 

490 

Ser Asn He Arg 

505 

He Trp Met Ala 



Asp Tyr Phe Gly 
540 

Tyr Tyr Asp Gin 
555 

lie Phe Leu Gly 
570 

Tyr lie Thr Met 
585 

Glu Val Arg Asn 



Phe Phe lie Val 
620 

Val Lys lie Leu 
635 

Ser Trp He Val 
650 

He Leu Tyr Thr 
665 



Tyr Cys Ser Tyr 
365 

Gly lie Ser Ser 



Phe Val Trp Val 
400 

He Gly Met Arg 
415 

Ser He Lys He 
430 

Phe Phe Val Gly 
445 

Tyr Ala Leu Leu 



Leu Ala Met Leu 

480 

Thr Leu Glu Lys 
495 

Pro Gly Lys Arg 
510 

Gly Phe Leu lie 
525 

Asn Phe Tyr Gly 



Thr Glu Asp He 
560 

Val Asn Leu Leu 
575 

Phe Cys Ser lie 
590 

Cys Phe Gly Arg 
605 

Phe Ser Asp Ala 



Ser Leu Phe Arg 
640 

He Phe Phe Leu 
655 

Leu Thr Thr Asn 
670 
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Phe Phe Lys Asp 
675 

Ser lie Phe Lys 

690 

lie Glu Asp Ser 
705 

Leu Gly Asp Ser 



Lys Lea Lys Gin 
680 

lie Lys Lys Lys 
695 

Ser Ser Leu Lys 
710 

lie Met Lys Pro 
725 



Leu Leu His Lys 



Ser Leu Ser Thr 

700 

Leu Gly Val Leu 
715 

Val Ser 
730 



His Gin Arg Lys 

685 

Ser lie Val Trp 



Asn Lys lie Thr 
720 



<210> 8 
<211> 694 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 
115 120 125 

Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

He Thr Thr Leu Arg Pro Gly He Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

Trp Leu He Leu Asp Asp Asn Pro lie Thr Arg He Ser Gin Arg Leu 
180 185 190 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
195 200 205 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 

210 215 220 
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Trp Val Asp Leu 

225 

Fhe Leu Ser Cys 



lie Thr Glu Leu 
260 

Lys Leu Asn Leu 

275 

Phe Glu Ser Leu 
290 

lie Pro Asn lie 

305 

His He Tyr Phe 



Arg He Cys Met 
340 

Leu Ala Asn Asn 
355 

Thr Cys Phe Gly 
370 

Ala Glu Asn Thr 
385 

Asp Cys Leu Met 



Lys Tyr Arg Gly 
420 

Val Gin Cys Arg 
435 

Ser Val Leu Leu 
450 

Val Phe Pro Phe 
465 

lie Leu He Cys 



Phe Trp Asn Lys 
500 

Cys Phe Pro Leu 
515 

Tyr Ser Leu Gly 

530 



Glu Gly Asn Arg 
230 

Asp Ser Leu Thr 

245 

Ser Pro His Leu 



Ser Ser Asn Pro 
280 

Lys Gin Leu Gin 
295 

Asn Thr Arg Met 
310 

Lys Asn Phe Arg 
325 

Pro Leu Thr Asp 



He Leu Arg He 
360 

Asn Leu Phe Val 
375 

Thr His Ala Met 
390 

Gly Val Tyr Leu 
405 

Gin Tyr Gin Lys 



Leu Met Gly Phe 
440 

Leu Thr Tyr Leu 
455 

Ser Asn He Arg 
470 

He Trp Met Ala 
485 

Asp Tyr Phe Gly 



Tyr Tyr Asp Gin 
520 

He Phe Leu Gly 

535 



lie Lys Tyr Leu 
235 

Val Leu Asp Leu 
250 

Phe Lys Asp Leu 
265 

Leu Met Tyr Leu 



Ser Leu Asp Leu 
300 

Phe Gin Pro Met 
315 

Tyr Cys Ser Tyr 
330 

Gly He Ser Ser 
345 

Phe Val Trp Val 



He Gly Met Arg 

380 

Ser He Lys He 
395 

Phe Phe Val Gly 
410 

Tyr Ala Leu Leu 
425 

Leu Ala Met Leu 



Thr Leu Glu Lys 
460 

Pro Gly Lys Arg 
475 

Gly Phe Leu He 
490 

Asn Phe Tyr Gly 

505 

Thr Glu Asp He 



Val Asn Leu Leu 
540 



Thr Asn Ser Thr 
240 

Ser Ser Asn Thr 
255 

Lys Leu Leu Gin 
270 

His Lys Asn Gin 
285 

Glu Arg He Glu 



Lys Asn Leu Ser 
320 

Ala Pro His Val 
335 

Phe Glu Asp Leu 
350 

He Ala Phe He 
365 

Ser Phe He Lys 



Leu Cys Cys Ala 
400 

lie Phe Asp He 
415 

Trp Met Glu Ser 
430 

Ser Thr Glu Val 
445 

Phe Leu Val He 



Gin Thr Ser Val 
480 

Ala Val He Pro 
495 

Lys Asn Gly Val 
510 

Gly Ser Lys Gly 
525 

Ala Phe Leu He 
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He Val Phe Ser Tyr lie Thr Met Phe Cys Ser lie Gin Lys Thr Ala 
545 550 555 560 

Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val 
565 570 575 

Ala Asn Arg Phe Phe Phe He Val Phe Ser Asp Ala He Cys Trp lie 
580 585 590 

Pro Val Phe Val Val Lys He Leu Ser Leu Phe Arg Val Glu He Pro 
595 600 605 

Asp Thr Met Thr Ser Trp He Val He Phe Phe Leu Pro Val Asn Ser 
610 615 620 

Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp 
625 630 635 640 

Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys Ser He Phe Lys 
645 650 655 

lie Lys Lys Lys Ser Leu Ser Thr Ser He Val Trp He Glu Asp Ser 
660 665 670 

Ser Ser Leu Lys Leu Gly Val Leu Asn Lys He Thr Leu Gly Asp Ser 
675 680 685 

He Met Lys Pro Val Ser 

690 



<210> 9 
<211> 1077 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1077) 

<400> 9 

caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 
Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

ggg aat ctt acc aag tgc tta ccc cga get ttt cac tgt gat ggc aag 
Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 



48 



96 



gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 144 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 

35 40 45 

gga tgg gcg acc ata ttt ggc aca gtg cat gga aat get aac age gtg 192 

Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 

50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 

65 70 75 80 
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tta gaa get ctt ccc aag cag atg tgt gec caa atg cct caa etc aac 
Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 



ttt ctg teg tgc gat teg etc aca gtg ctg gat ctg tct age aat acg 
Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu Ser Ser Asn Thr 
245 250 255 



336 



384 



tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 

Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 

lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 
115 120 125 

aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 432 

Lys lie Phe Leu Gin His Asn Cys lie Arg His lie Ser Arg Lys Ala 

130 135 140 

ttt ttt gga tta tgt aat ctg caa ata tta tat etc aac cac aac tgc 480 

Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu Asn His Asn Cys 

145 150 155 160 

ate aca ace etc aga cct gga ata ttc aaa gac tta cat cag eta act 528 

lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

tgg eta att eta gat gac aat cca ata acc aga att tea cag cgc ttg 5 7 6 

Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Arg Leu 
180 185 190 

ttt acg gga tta aat tec ttg ttt ttc ctg tct atg gtt aat aac tac 624 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
195 200 205 



672 



tgg gtg gat ttg gaa ggc aat aga ata aag tat etc aca aat tct acg 720 
Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 
225 230 235 240 



768 



ata acg gag eta tea cct cac ctt ttt aaa gac ttg aag ctt eta caa 816 

lie Thr Glu Leu Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin 
260 265 270 

aag ctg aac ctg tea tec aat cct ctt atg tat ctt cac aag aac cag 864 

Lys Leu Asn Leu Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin 

275 280 285 

ttt gaa agt ctt aaa caa ctt cag tcr eta gac ctg gaa agg ata gag 912 

Phe Glu Ser Leu Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg lie Glu 
290 295 300 

att cca aat ata aac aca cga atg ttt caa ccc atg aag aat ctt tct 960 

lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 

305 310 315 320 

cac att tat ttc aaa aac ttt cga tac tgc tec tat get ccc cat gtc 1008 
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His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val 

325 330 335 

cga ata tgt atg ccc ttg acg gac ggc att tct tea ttt gag gac etc 1056 

Arg lie Cys Met Pro Leu Thr Asp Gly He Ser Ser Phe Glu Asp Leu 
340 345 350 

ttg get aac aat ate etc aga 1077 

Leu Ala Asn Asn He Leu Arg 
355 



<210> 10 
<211> 359 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 
115 120 125 

Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

He Thr Thr Leu Arg Pro Gly He Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He Ser Gin Arg Leu 
180 185 190 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 
195 200 205 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 
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225 

Phe Leu Ser Cys 



He Thr Glu Leu 
260 

Lys Leu Asn Leu 
275 

Phe Glu Ser Leu 
290 

He Pro Asn He 

305 

His He Tyr Phe 



Arg He Cys Met 
340 

Leu Ala Asn Asn 
355 



230 

Asp Ser Leu Thr 
245 

Ser Pro His Leu 



Ser Ser Asn Pro 

280 

Lys Gin Leu Gin 
295 

Asn Thr Arg Met 
310 

Lys Asn Phe Arg 
325 

Pro Leu Thr Asp 



He Leu Arg 



235 

Val Leu Asp Leu 

250 

Phe Lys Asp Leu 
265 

Leu Met Tyr Leu 



Ser Leu Asp Leu 
300 

Phe Gin Pro Met 
315 

Tyr Cys Ser Tyr 
330 

Gly He Ser Ser 
345 



Ser Ser Asn Thr 
255 

Lys Leu Leu Gin 

270 

His Lys Asn Gin 
285 

Glu Arg He Glu 



Lys Asn Leu Ser 

320 

Ala Pro His Val 
335 

Phe Glu Asp Leu 
350 



<210> 11 

<211> 2049 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) - . (2046) 

<220> 

< 2 2 1 > sig__peptide 
<222> (1) . . (108) 

<400> 11 

atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 

Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 

15 10 15 



ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 

Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 

35 40 45 

tat ttt ccc tgt ggg aat ctt acc aag tgc tta ccc cga get ttt cac 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 

50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 

65 70 75 80 



48 



aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 
Thr Met Phe Phe Leu Leu His Phe He Val Leu He Asn Val Lys Asp 
20 25 30 



144 



192 



240 
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ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 
Gly Asp Thr Ser Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn 
85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 
Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 
Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 
Leu Lys Ser Val Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 
Lys Lys Asn Lys He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr 
145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 
Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta att eta 576 
Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin He Leu He Leu 
180 185 190 

gat gac aat cca ata acc aga att tea cag cgc ttg ttt acg gga tta 624 
Asp Asp Asn Pro He Thr Arg He Ser Gin Arg Leu Phe Thr Gly Leu 
195 200 205 

aat tec ttg ttt ttc ctg tct atg gtt aat aac tac tta gaa get ctt 672 
Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr Leu Glu Ala Leu 
210 215 220 

ccc aag cag atg tgt gec caa atg cct caa etc aac tgg gtg gat ttg 720 
Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn Trp Val Asp Leu 
225 230 235 240 

gaa ggc aat aga ata aag tat etc aca aat tct acg ttt ctg teg tgc 768 
Glu Gly Asn Arg He Lys Tyr Leu Thr Asn Ser Thr Phe Leu Ser Cys 
245 250 255 

gat teg etc aca gtg ctg gat ctg tct age aat acg ata acg gag eta 816 
Asp Ser Leu Thr Val Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu 
260 265 270 

tea cct cac ctt ttt aaa gac ttg aag ctt eta caa aag eta gac ctg 864 
Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asp Leu 
275 280 285 

gaa agg ata gag att cca aat ata aac aca cga atg ttt caa ccc atg 912 
Glu Arg He Glu He Pro Asn He Asn Thr Arg Met Phe Gin Pro Met 
290 295 300 

aag aat ctt tct cac att tat ttc aaa aac ttt cga tac tgc tec tat 960 
Lys Asn Leu Ser His He Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr 
305 310 315 320 

get ccc cat gtc cga ata tgt atg ccc ttg acg gac ggc att tct tea 1008 
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Ala Pro His Val Arg He Cys Met Pro Leu Thr Asp Gly He Ser Ser 

325 " 330 335 

ttt gag gac etc ttg get aac aat ate etc aga ata ttt gtc tgg gtt 1056 

Phe Glu Asp Leu Leu Ala Asn Asn He Leu Arg He Phe Val Trp Val 

340 345 350 

ata get ttc att ace tgc ttt gga aat ctt ttt gtc att ggc atg aga 1104 

He Ala Phe He Thr Cys Phe Gly Asn Leu Phe Val He Gly Met Arg 

355 360 365 

tct ttc att aaa get gaa aat aca act cac get atg tec ate aaa ate 1152 

Ser Phe He Lys Ala Glu Asn Thr Thr His Ala Met Ser He Lys He 

370 375 380 

ctt tgt tgt get gat tgc ctg atg ggt gtt tac ttg ttc ttt gtt ggc 1200 

Leu Cys Cys Ala Asp Cys Leu Met Gly Val Tyr Leu Phe Phe Val Gly 

385 390 395 400 

att ttc gat ata aaa tac cga ggg cag tat cag aag tat gec ttg ctg 1248 

He Phe Asp He Lys Tyr Arg Gly Gin Tyr Gin Lys Tyr Ala Leu Leu 

405 410 415 

tgg atg gag age gtg cag tgc cgc etc atg ggg ttc ctg gee atg ctg 1296 

Trp Met Glu Ser Val Gin Cys Arg Leu Met Gly Phe Leu Ala Met Leu 

420 425 430 

tec acc gaa gtc tct gtt ctg eta ctg acc tac ttg act ttg gag aag 1344 

Ser Thr Glu Val Ser Val Leu Leu Leu Thr Tyr Leu Thr Leu Glu Lys 

435 440 445 

ttc ctg gtc att gtc ttc ccc ttc agt aac att cga cct gga aaa egg 1392 

Phe Leu Val He Val Phe Pro Phe Ser Asn He Arg Pro Gly Lys Arg 

450 455 460 

cag acc tea gtc ate etc att tgc ate tgg atg gcg gga ttt tta ata 1440 

Gin Thr Ser Val He Leu He Cys He Trp Met Ala Gly Phe Leu He 

465 470 475 480 

get gta att cca ttt tgg aat aag gat tat ttt gga aac ttt tat ggg 1488 

Ala Val He Pro Phe Trp Asn Lys Asp Tyr Phe Gly Asn Phe Tyr Gly 

485 490 495 

aaa aat gga gta tgt ttc cca ctt tat tat gac caa aca gaa gat att 1536 

Lys Asn Gly Val Cys Phe Pro Leu Tyr Tyr Asp Gin Thr Glu Asp He 

500 505 510 

gga age aaa ggg tat tct ctt gga att ttc eta ggt gtg aac ttg ctg 1584 

Gly Ser Lys Gly Tyr Ser Leu Gly He Phe Leu Gly Val Asn Leu Leu 

515 520 525 

get ttt etc ate att gtg ttt tec tat att act atg ttc tgt tec att 1632 

Ala Phe Leu He He Val Phe Ser Tyr He Thr Met Phe Cys Ser He 

530 535 540 

caa aaa acc gec ttg cag acc aca gaa gta agg aat tgt ttt gga aga 1680 

Gin Lys Thr Ala Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg 

545 550 555 560 

gag gtg get gtt gca aat cgt ttc ttt ttt ata gtg ttc tct gat gec 1728 

Glu Val Ala Val Ala Asn Arg Phe Phe Phe He Val Phe Ser Asp Ala 
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565 570 575 

ate tgc tgg att cct gta ttt gta gtt aaa ate ctt tec etc ttc egg 1776 

lie Cys Trp lie Pro Val Phe Val Val Lys lie Leu Ser Leu Phe Arg 
580 585 590 

gtg gaa at a cca gac aca atg act tec tgg at a gtg att ttt ttc ctt 1824 

Val Glu lie Pro Asp Thr Met Thr Ser Trp lie Val lie Phe Phe Leu 
595 600 605 

cca gtt aac agt get ttg aat cca ate etc tat act etc aca acc aac 1872 

Pro Val Asn Ser Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn 
610 615 620 

ttt ttt aag gac aag ttg aaa cag ctg ctg cac aaa cat cag agg aaa 1920 

Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys 

625 630 635 640 

tea att ttc aaa att aaa aaa aaa agt tta tct aca tec att gtg tgg 1968 

Ser He Phe Lys lie Lys Lys Lys Ser Leu Ser Thr Ser He Val Trp 
645 650 655 

ata gag gac tec tct tec ctg aaa ctt ggg gtt ttg aac aaa ata aca 2016 

He Glu Asp Ser Ser Ser Leu Lys Leu Gly Val Leu Asn Lys He Thr 
660 665 670 

ctt gga gac agt ata atg aaa cca gtttcctag 2049 

Leu Gly Asp Ser He Met Lys Pro Val Ser 
675 680 



<210> 12 
<211> 682 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 
15 10 15 

Thr Met Phe Phe Leu Leu His Phe lie Val Leu He Asn Val Lys Asp 
20 25 30 

Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 
35 40 45 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 
50 55 60 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 
65 70 75 80 

Gly Asp Thr Ser Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn 
85 90 95 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 
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Leu Lys Ser Val 
130 

Lys Lys Asn Lys 
145 

Thr Lys Leu Lys 



Ser Arg Lys Ala 
180 

Asp Asp Asn Pro 

195 

Asn Ser Leu Phe 
210 

Pro Lys Gin Met 
225 

Glu Gly Asn Arg 



Asp Ser Leu Thr 
260 

Ser Pro His Leu 

275 

Glu Arg lie Glu 

290 

Lys Asn Leu Ser 

305 

Ala Pro His Val 



Phe Glu Asp Leu 
340 

lie Ala Phe lie 
355 

Ser Phe lie Lys 
370 

Leu Cys Cys Ala 
385 

lie Phe Asp lie 



Trp Met Glu Ser 
420 

Ser Thr Glu Val 
435 

Phe Leu Val lie 



Pro Met lie Ser 
135 

lie His Ser Leu 
150 

Lys lie Phe Leu 
165 

Phe Phe Gly Leu 



lie Thr Arg lie 
200 

Phe Leu Ser Met 
215 

Cys Ala Gin Met 
230 

lie Lys Tyr Leu 
245 

Val Leu Asp Leu 



Phe Lys Asp Leu 

280 

lie Pro Asn lie 
295 

His lie Tyr Phe 

310 

Arg lie Cys Met 
325 

Leu Ala Asn Asn 



Thr Cys Phe Gly 
360 

Ala Glu Asn Thr 
375 

Asp Cys Leu Met 
390 

Lys Tyr Arg Gly 

405 

Val Gin Cys Arg 



Ser Val Leu Leu 
440 

Val Phe Pro Phe 



Asn Asn Val Thr 
140 

Pro Asp Lys Val 
155 

Gin His Asn Cys 
170 

Cys Asn Leu Gin 
185 

Ser Gin Arg Leu 



Val Asn Asn Tyr 

220 

Pro Gin Leu Asn 
235 

Thr Asn Ser Thr 
250 

Ser Ser Asn Thr 
265 

Lys Leu Leu Gin 



Asn Thr Arg Met 
300 

Lys Asn Phe Arg' 
315 

Pro Leu Thr Asp 

330 

lie Leu Arg lie 
345 

Asn Leu Phe Val 



Thr His Ala Met 
380 

Gly Val Tyr Leu 
395 

Gin Tyr Gin Lys 
410 

Leu Met Gly Phe 
425 

Leu Thr Tyr Leu 



Ser Asn lie Arg 



Leu Leu Ser Leu 



Phe lie Lys Tyr 

160 

lie Arg His lie 

175 

lie Leu lie Leu 
190 

Phe Thr Gly Leu 
205 

Leu Glu Ala Leu 



Trp Val Asp Leu 
240 

Phe Leu Ser Cys 
255 

lie Thr Glu Leu 
270 

Lys Leu Asp Leu 

285 

Phe Gin Pro Met 



Tyr Cys Ser Tyr 

320 

Gly lie Ser Ser 
335 

Phe Val Trp Val 
350 

lie Gly Met Arg 
365 

Ser lie Lys He 



Phe Phe Val Gly 
400 

Tyr Ala Leu Leu 
415 

Leu Ala Met Leu 
430 

Thr Leu Glu Lys 
445 

Pro Gly Lys Arg 
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450 

Gin Thr Ser Val lie 
465 

Ala Vai He Pro Phe 
485 

Lys Asn Gly Val Cys 
500 

Gly Ser Lys Gly Tyr 
515 

Ala Phe Leu He He 

530 

Gin Lys Thr Ala Leu 

545 

Glu Val Ala Val Ala 
565 

He Cys Trp He Pro 

580 

Val Glu He Pro Asp 
595 

Pro Val Asn Ser Ala 
610 

Phe Phe Lys Asp Lys 
625 

Ser He Phe Lys He 
645 

He Glu Asp Ser Ser 
660 

Leu Gly Asp Ser He 
675 



455 

Leu lie Cys He Trp Met 
470 475 

Trp Asn Lys Asp Tyr Phe 
490 

Phe Pro Leu Tyr Tyr Asp 
505 

Ser Leu Gly He Phe Leu 
520 

Val Phe Ser Tyr He Thr 
535 

Gin Thr Thr Glu Val Arg 
550 555 

Asn Arg Phe Phe Phe He 
570 

Val Phe Vai Val Lys lie 
585 

Thr Met Thr Ser Trp He 
600 

Leu Asn Pro He Leu Tyr 
615 

Leu Lys Gin Leu Leu His 

630 635 

Lys Lys Lys Ser Leu Ser 
650 

Ser Leu Lys Leu Gly Val 
665 

Met Lys Pro Val Ser 
680 



460 

Ala Gly Phe Leu He 
480 

Gly Asn Phe Tyr Gly 
495 

Gin Thr Glu Asp He 

510 

Gly Val Asn Leu Leu 
525 

Met Phe Cys Ser He 
540 

Asn Cys Phe Gly Arg 
560 

Val Phe Ser Asp Ala 
575 

Leu Ser Leu Phe Arg 
590 

Val He Phe Phe Leu 
605 

Thr Leu Thr Thr Asn 
620 

Lys His Gin Arg Lys 
640 

Thr Ser He Val Trp 
655 

Leu Asn Lys lie Thr 
670 



<210> 13 
<211> 646 
<212> PRT 

<213> Homo sapiens 
<400> 13 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 

20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
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50 



55 



60 



Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 



Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 
115 120 125 

Lys lie Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu He Leu Asp Asp Asn Pro 
145 150 155 160 

He Thr Arg He Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe 
165 170 175 

Phe Leu Ser Met Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met 
180 185 190 

Cys Ala Gin Met Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg 
195 200 205 

He Lys Tyr Leu Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr 
210 215 220 

Val Leu Asp Leu Ser Ser Asn Thr He Thr Glu Leu Ser Pro His Leu 
225 230 235 240 

Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asp Leu Glu Arg He Glu 
245 250 255 

He Pro Asn He Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 
260 265 270 

His He Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val 
275 280 285 

Arg He Cys Met Pro Leu Thr Asp Gly He Ser Ser Phe Glu Asp Leu 
290 295 300 

Leu Ala Asn Asn He Leu Arg lie Phe Val Trp Val lie Ala Phe He 
305 310 315 320 

Thr Cys Phe Gly Asn Leu Phe Val He Gly Met Arg Ser Phe He Lys 
325 330 335 

Ala Glu Asn Thr Thr His Ala Met Ser He Lys He Leu Cys Cys Ala 
340 345 350 

Asp Cys Leu Met Gly Val Tyr Leu Phe Phe Val Gly He Phe Asp He 
355 360 365 

Lys Tyr Arg Gly Gin Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser 
370 375 380 



85 



90 



95 
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Val Gin Cys Arg Leu Met Gly Phe Leu Ala Met Leu Ser Thr Glu Val 
385 390 395 400 

Ser Val Leu Leu Leu Thr Tyr Leu Thr Leu Glu Lys Phe Leu Val lie 
405 410 415 

Val Phe Pro Phe Ser Asn lie Arg Pro Gly Lys Arg Gin Thr Ser Val 
420 425 430 

lie Leu lie Cys lie Trp Met Ala Gly Phe Leu lie Ala Val lie Pro 
435 440 445 

Phe Trp Asn Lys Asp Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val 
450 455 460 

Cys Phe Pro Leu Tyr Tyr Asp Gin Thr Glu Asp lie Gly Ser Lys Gly 
465 470 475 480 

Tyr Ser Leu Gly He Phe Leu Gly Val Asn Leu Leu Ala Phe Leu He 
485 490 495 

He Val Phe Ser Tyr He Thr Met Phe Cys Ser He Gin Lys Thr Ala 
500 505 510 

Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val 
515 520 525 

Ala Asn Arg Phe Phe Phe He Val Phe Ser Asp Ala He Cys Trp He 
530 535 540 

Pro Val Phe Val Val Lys lie Leu Ser Leu Phe Arg Val Glu He Pro 
545 550 555 560 

Asp Thr Met Thr Ser Trp He Val He Phe Phe Leu Pro Val Asn Ser 
565 570 575 

Ala Leu Asn Pro He Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp 
580 585 590 

Lys Leu Lys Gin Leu Leu His Lys His Gin Arg Lys Ser He Phe Lys 
595 600 605 

He Lys Lys Lys Ser Leu Ser Thr Ser He Val Trp He Glu Asp Ser 
610 615 620 

Ser Ser Leu Lys Leu Gly Val Leu Asn Lys He Thr Leu Gly Asp Ser 
625 630 635 640 

He Met Lys Pro Val Ser 
645 



<210> 14 
<211> 933 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) - - (933) 
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<400> 14 

caa ggt age atg ate act cct tea tgc caa aaa gga tat ttt ccc tgt 48 
Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

ggg aat ctt acc aag tgc tta ccc cga get ttt cac tgt gat ggc aag 96 
Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 
20 25 30 

gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt ggt gac act agt 14 4 
Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

gga tgg gcg acc ata ttt ggc aca gtg cat gga aat get aac age gtg 192 
Gly Trp Ala Thr He Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 

50 55 60 

gec tta aca cag gag tgc ttt eta aaa cag tat cca caa tgc tgt gac 240 
Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac tta aag tct gtg 288 
Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

ccg atg att tct aac aat gtg aca tta ctg tct ctt aag aaa aac aaa 336 
Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

ate cac agt ctt cca gat aaa gtt ttc ate aaa tac aca aaa ctt aaa 384 
He His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr Thr Lys Leu Lys 
115 120 125 

aag ata ttt ctt cag cat aat tgc att aga cac ata tec agg aaa gca 432 
Lys He Phe Leu Gin His Asn Cys He Arg His He Ser Arg Lys Ala 
130 135 140 

ttt ttt gga tta tgt aat ctg caa ata tta att eta gat gac aat cca 480 
Phe Phe Gly Leu Cys Asn Leu Gin He Leu He Leu Asp Asp Asn Pro 
145 150 155 160 

ata acc aga att tea cag cgc ttg ttt acg gga tta aat tec ttg ttt 528 
He Thr Arg He Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe 
165 170 175 

ttc ctg tct atg gtt aat aac tac tta gaa get ctt ccc aag cag atg 576 
Phe Leu Ser Met Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met 
180 185 190 

tgt gee caa atg cct caa etc aac tgg gtg gat ttg gaa ggc aat aga 624 
Cys Ala Gin Met Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg 
195 200 205 

ata aag tat etc aca aat tct acg ttt ctg teg tgc gat teg etc aca 672 
He Lys Tyr Leu Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr 
210 215 220 

gtg ctg gat ctg tct age aat acg ata acg gag eta tea cct cac ctt 720 
Val Leu Asp Leu Ser Ser Asn Thr lie Thr Glu Leu Ser Pro His Leu 
225 230 235 240 
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ttt aaa gac ttg aag ctt eta caa aag eta gac ctg gaa agg ata gag 
Phe Lys Asp Leu Lys Leu Leu Gin Lys Leu Asp Leu Glu Arg lie Glu 
245 250 255 



768 



att cca aat ata aac aca cga atg ttt caa ccc atg aag aat ctt tct 
lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 
260 265 270 



816 



cac att tat ttc aaa aac ttt cga tac tgc tec tat get ccc cat gtc 
His lie Tyr Phe Lys Asn Phe Arg Tyr Cys Ser Tyr Ala Pro His Val 
275 280 285 



864 



cga ata tgt atg ccc ttg acg gac ggc att tct tea ttt gag gac etc 
Arg lie Cys Met Pro Leu Thr Asp Gly lie Ser Ser Phe Glu Asp Leu 
290 295 300 



912 



ttg get aac aat ate etc aga 
Leu Ala Asn Asn lie Leu Arg 
305 310 



933 



<210> 15 
<211> 311 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Lys 

20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn Ala Asn Ser Val 
50 55 60 

Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro Gin Cys Cys Asp 
65 70 75 80 

Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp Leu Lys Ser Val 
85 90 95 

Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr Thr Lys Leu Lys 
115 120 125 

Lys lie Phe Leu Gin His Asn Cys lie Arg His He Ser Arg Lys Ala 
130 135 140 

Phe Phe Gly Leu Cys Asn Leu Gin He Leu He Leu Asp Asp Asn Pro 
145 150 155 160 

He Thr Arg He Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe 
165 170 175 
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Phe Leu Ser Met 
180 

Cys Ala Gin Met 
195 

lie Lys Tyr Leu 
210 

Val Leu Asp Leu 
225 

Phe Lys Asp Leu 



lie Pro Asn lie 
260 

His lie Tyr Phe 
275 

Arg He Cys Met 
290 

Leu Ala Asn Asn 
305 



Val Asn Asn Tyr 



Pro Gin Leu Asn 

200 

Thr Asn Ser Thr 
215 

Ser Ser Asn Thr 
230 

Lys Leu Leu Gin 
245 

Asn Thr Arg Met 



Lys Asn Phe Arg 
280 

Pro Leu Thr Asp 
295 

He Leu Arg 
310 



Leu Glu Ala Leu 

185 

Trp Val Asp Leu 



Phe Leu Ser Cys 

220 

He Thr Glu Leu 
235 

Lys Leu Asp Leu 
250 

Phe Gin Pro Met 
265 

Tyr Cys Ser Tyr 



Gly He Ser Ser 

300 



Pro Lys Gin Met 
190 

Glu Gly Asn Arg 

205 

Asp Ser Leu Thr 



Ser Pro His Leu 
240 

Glu Arg He Glu 
255 

Lys Asn Leu Ser 
270 

Ala Pro His Val 

285 

Phe Glu Asp Leu 



<210> 16 
<211> 1101 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1098) 

<220> 

<221> sig_peptide 
<222> (1) . . (108) 

<400> 16 

atg att gtt ttt ctg gtt ttt aaa cat etc ttc age etc aga ttg att 48 

Met: He Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu He 
15 10 15 

aca atg ttc ttt eta ctt cat ttc ate gtt ctg ate aat gtc aaa gat 96 
Thr Met Phe Phe Leu Leu His Phe He Val Leu He Asn Val Lys Asp 

20 25 30 

ttt gca ctg act caa ggt age atg ate act cct tea tgc caa aaa gga 14 4 
Phe Ala Leu Thr Gin Gly Ser Met He Thr Pro Ser Cys Gin Lys Gly 
35 40 45 

tat ttt ccc tgt ggg aat ctt ace aag tgc tta ccc cga get ttt cac 192 
Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 
50 55 60 

tgt gat ggc aag gat gac tgt ggg aac ggg gcg gac gaa gag aac tgt 240 
Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 
65 70 75 80 
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ggt gac act agt gga tgg gcg acc ata ttt ggc aca gtg cat gga aat 288 
Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 
85 90 95 

get aac age gtg gec tta aca cag gag tgc ttt eta aaa cag tat cca 336 
Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

caa tgc tgt gac tgc aaa gaa act gaa ttg gaa tgt gta aat ggt gac 384 
Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 

tta aag tct gtg ccg atg att tct aac aat gtg aca tta ctg tct ctt 432 
Leu Lys Ser Val Pro Met He Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 

aag aaa aac aaa ate cac agt ctt cca gat aaa gtt ttc ate aaa tac 480 
Lys Lys Asn Lys He His Ser Leu Pro Asp Lys Val Phe He Lys Tyr 
145 150 155 160 

aca aaa ctt aaa aag ata ttt ctt cag cat aat tgc att aga cac ata 528 
Thr Lys Leu Lys Lys He Phe Leu Gin His Asn Cys He Arg His He 
165 170 175 

tec agg aaa gca ttt ttt gga tta tgt aat ctg caa ata tta tat etc 576 
Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin He Leu Tyr Leu 
180 185 190 

aac cac aac tgc ate aca acc etc aga cct gga ata ttc aaa gac tta 624 
Asn His Asn Cys He Thr Thr Leu Arg Pro Gly He Phe Lys Asp Leu 
195 200 205 

cat cag eta act tgg eta att eta gat gac aat cca ata acc aga att 672 
His Gin Leu Thr Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He 
210 215 220 

tea cag cgc ttg ttt acg gga tta aat tec ttg ttt ttc ctg tct atg 720 
Ser Gin Arg Leu Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met 
225 230 235 240 

gtt aat aac tac tta gaa get ctt ccc aag cag atg tgt gec caa atg 768 
Val Asn Asn Tyr Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met 
245 250 255 

cct caa etc aac tgg gtg gat ttg gaa ggc aat aga ata aag tat etc 816 
Pro Gin Leu Asn Trp Val Asp Leu Glu Gly Asn Arg He Lys Tyr Leu 
260 265 270 

aca aat tct acg ttt ctg teg tgc gat teg etc aca gtg ctg gat ctg 864 
Thr Asn Ser Thr Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu 
275 280 285 

tct age aat acg ata acg gag eta tea cct cac ctt ttt aaa gac ttg 912 
Ser Ser Asn Thr He Thr Glu Leu Ser Pro His Leu Phe Lys Asp Leu 
290 295 300 

aag ctt eta caa aag ctg aac ctg tea tec aat cct ctt atg tat ctt 960 
Lys Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro Leu Met Tyr Leu 
305 310 315 320 
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cac aag aac cag ttt gaa agt ctt aaa caa ctt cag tct eta gac ctg 1008 

Kis Lys Asn Gin Phe Glu Ser Leu Lys Gin Leu Gin Ser Leu Asp Leu 

325 330 335 

gaa agg ata gag att cca aat ata aac aca cga atg ttt caa ccc atg 1056 

Glu Arg lie Glu lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met 

340 345 350 

aag aat ctt tct cac ata gtt caa tat tat gat gtg ccg aca tga 1101 

Lys Asn Leu Ser His lie Val Gin Tyr Tyr Asp Val Pro Thr 

355 360 365 



<210> 17 
<211> 366 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Met lie Val Phe Leu Val Phe Lys His Leu Phe Ser Leu Arg Leu lie 
15 10 15 

Thr Met Phe Phe Leu Leu His Phe lie Val Leu lie Asn Val Lys Asp 
20 25 30 

Phe Ala Leu Thr Gin Gly Ser Met lie Thr Pro Ser Cys Gin Lys Gly 
35 40 45 

Tyr Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His 
50 55 60 

Cys Asp Gly Lys Asp Asp Cys Gly Asn Gly Ala Asp Glu Glu Asn Cys 
65 70 75 80 

Gly Asp Thr Ser Gly Trp Ala Thr lie Phe Gly Thr Val His Gly Asn 
85 90 95 

Ala Asn Ser Val Ala Leu Thr Gin Glu Cys Phe Leu Lys Gin Tyr Pro 
100 105 110 

Gin Cys Cys Asp Cys Lys Glu Thr Glu Leu Glu Cys Val Asn Gly Asp 
115 120 125 

Leu Lys Ser Val Pro Met lie Ser Asn Asn Val Thr Leu Leu Ser Leu 
130 135 140 

Lys Lys Asn Lys lie His Ser Leu Pro Asp Lys Val Phe lie Lys Tyr 
145 150 155 160 

Thr Lys Leu Lys Lys lie Phe Leu Gin His Asn Cys lie Arg His lie 
165 170 175 

Ser Arg Lys Ala Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu 
180 185 190 

Asn His Asn Cys lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu 
195 200 205 

His Gin Leu Thr Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie 
210 215 220 



35 



WO 02/14489 



PCT/US01/25054 



Ser Gin Arg Leu Phe 

225 

Val Asn Asn Tyr Leu 
245 

Pro Gin Leu Asn Trp 

260 

Thr Asn Ser Thr Phe 
275 

Ser Ser Asn Thr lie 
290 

Lys Leu Leu Gin Lys 
305 

His Lys Asn Gin Phe 
325 

Glu Arg lie Glu lie 
340 

Lys Asn Leu Ser His 
355 



Thr Gly Leu Asn Ser Leu 
230 235 

Glu Ala Leu Pro Lys Gin 
250 

Val Asp Leu Glu Gly Asn 
265 

Leu Ser Cys Asp Ser Leu 
280 

Thr Glu Leu Ser Pro His 
295 

Leu Asn Leu Ser Ser Asn 
310 315 

Glu Ser Leu Lys Gin Leu 
330 

Pro Asn lie Asn Thr Arg 
345 

lie Val Gin Tyr Tyr Asp 
360 



Phe Phe Leu Ser Met 

240 

Met Cys Ala Gin Met 

255 

Arg lie Lys Tyr Leu 

270 

Thr Val Leu Asp Leu 
285 

Leu Phe Lys Asp Leu 

300 

Pro Leu Met Tyr Leu 

320 

Gin Ser Leu Asp Leu 

335 

Met Phe Gin Pro Met 
350 

Val Pro Thr 

365 



<210> 18 
<211> 330 
<212> PRT 

<213> Homo sapiens 



<400> 18 
Gin Gly Ser Met 
1 

Gly Asn Leu Thr 
20 

Asp Asp Cys Gly 
35 

Gly Trp Ala Thr 
50 

Ala Leu Thr Gin 
65 

Cys Lys Glu Thr 



Pro Met lie Ser 
100 

lie His Ser Leu 
115 

Lys lie Phe Leu 
130 



lie Thr Pro Ser 
5 

Lys Cys Leu Pro 



Asn Gly Ala Asp 
40 

lie Phe Gly Thr 

55 

Glu Cys Phe Leu 
70 

Glu Leu Glu Cys 
85 

Asn Asn Val Thr 



Pro Asp Lys Val 
120 

Gin His Asn Cys 
135 



Cys Gin Lys Gly 
10 

Arg Ala Phe His 

25 

Glu Glu Asn Cys 



Val His Gly Asn 
60 

Lys Gin Tyr Pro 

75 

Val Asn Gly Asp 
90 

Leu Leu Ser Leu 

105 

Phe He Lys Tyr 



He Arg His He 
140 



Tyr Phe Pro Cys 
15 

Cys Asp Gly Lys 
30 

Gly Asp Thr Ser 
45 

Ala Asn Ser Val 



Gin Cys Cys Asp 

80 

Leu Lys Ser Val 
95 

Lys Lys Asn Lys 

110 

Thr Lys Leu Lys 
125 

Ser Arg Lys Ala 
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Phe Phe Gly Leu Cys Asn Leu Gin lie Leu Tyr Leu Asn His Asn Cys 
145 150 155 160 

lie Thr Thr Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Thr 
165 170 175 

Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Arg Leu 
180 185 190 

Phe Thr Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Asn Asn Tyr 

195 200 205 

Leu Glu Ala Leu Pro Lys Gin Met Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

Trp Val Asp Leu Glu Gly Asn Arg lie Lys Tyr Leu Thr Asn Ser Thr 

225 230 235 240 

Phe Leu Ser Cys Asp Ser Leu Thr Val Leu Asp Leu Ser Ser Asn Thr 
245 250 255 

lie Thr Glu Leu Ser Pro His Leu Phe Lys Asp Leu Lys Leu Leu Gin 
260 265 270 

Lys Leu Asn Leu Ser Ser Asn Pro Leu Met Tyr Leu His Lys Asn Gin 
275 280 285 

Phe Glu Ser Leu Lys Gin Leu Gin Ser Leu Asp Leu Glu Arg lie Glu 
290 295 300 

lie Pro Asn lie Asn Thr Arg Met Phe Gin Pro Met Lys Asn Leu Ser 
305 310 315 320 

His lie Val Gin Tyr Tyr Asp Val Pro Thr 
325 330 



<210> 19 

<211> 2214 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (2211) 

<220> 

<221> sig_peptide 
<222> (1) . . (57) 

<400> 19 

atg tgg etc eta ctt cat gtc ate ctt ctg aca gag gtc aaa gat ttt 48 

Met Trp Leu Leu Leu His Val lie Leu Leu Thr Glu Val Lys Asp Phe 

15 10 15 

gca ctg get gac age agt atg gtg get cct ctg tgc ccc aaa ggg tat 96 
Ala Leu Ala Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr 
20 25 30 

ttt ccc tgt ggg aat etc acc aaa tgc ttg ccc cga gee ttt cac tgc 144 
Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys 



37 



WO 02/14489 



PCT/U SO 1/25054 



35 40 45 

gat ggt gtg gat gat tgc ggg aat ggt gcc gac gag gac aac tgt ggt 192 

Asp Gly Val Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly 

50 55 60 

gac act agt gga tgg aca acc ata ttt ggc aca gtc cat ggg aat gtc 240 

Asp Thr Ser Gly Trp Thr Thr lie Phe Gly Thr Val His Gly Asn Val 
65 70 75 80 

aat aaa gtg aca ttg aca cag gag tgc ttt etc age cag tat cca cag 288 

Asn Lys Val Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin 
85 90 95 

cac tgt tac tgc aga gaa aat gaa ctg gaa tgt gta aag get gac tta 33 6 

His Cys Tyr Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu 
100 105 110 

aaa get gtg cca aag gtt tec age aac gta aca tta eta tct ctt aag 384 

Lys Ala Val Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys 
115 120 125 

aaa aac aaa ate cac aga ctt cca gtc aag gtc ttc age aga tac aca 432 

Lys Asn Lys He His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr 
130 135 140 

gaa etc aga aag ata tac ctt cag cac aac tgc ate aca cac ate tec 480 

Glu Leu Arg Lys He Tyr Leu Gin His Asn Cys He Thr His He Ser 
145 150 155 160 

agg aga gca ttc ctt gga tta cat aat eta caa ata ctg tat etc age 528 

Arg Arg Ala Phe Leu Gly Leu His Asn Leu Gin Tie Leu Tyr Leu Ser 
165 170 175 

cat aac tgc att acc tct etc agg cct ggg ata ttc aaa gac ttg cat 57 6 

His Asn Cys He Thr Ser Leu Arg Pro Gly He Phe Lys Asp Leu His 
180 185 190 

cag ctt get tgg eta att tta gat gac aac ccg ate acc aga ate tea 624 

Gin Leu Ala Trp Leu lie Leu Asp Asp Asn Pro He Thr Arg lie Ser 

195 200 205 

cag aag tec ttt atg ggg tta aac tec ttg ttt ttc ttg tec atg gtg 672 

Gin Lys Ser Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val 
210 215 220 

ggt aac egg etc gag gcc ctt cct gaa aca ttg tgt get cag atg cct 720 

Gly Asn Arg Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro 
225 230 235 240 

caa etc aac tgg gtg gat ctg gca aac aat gga ata aag tac ata acg 768 

Gin Leu Asn Trp Val Asp Leu Ala Asn Asn Gly He Lys Tyr He Thr 
245 250 255 

aac tec acc ttc eta acg tgc gac teg etc acg gtt ctg ttt ctg cct 816 

Asn Ser Thr Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro 
260 265 270 

aga aat caa att ggt ttt gtt cca gag aag aca ttt tct tea tta aaa 864 

Arg Asn Gin He Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys 
275 280 285 
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aat tta gga gaa ctg gac ctg tct age aat atg ata aca aaa etc cca 912 

Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Met He Thr Lys Leu Pro 

290 295 300 

gtc cac ctt ttc age gac ctt cat ctt etc cag aag ctg aac ctg tea 960 

Val His Leu Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser 

305 310 315 320 

tec aac cct ctt ctg tat gtc cac aag aac cag ttt gga agt etc aaa 1008 

Ser Asn Pro Leu Leu Tyr Val His Lys Asn Gin Phe Gly Ser Leu Lys 

325 330 335 



caa ctt cag tct eta gac ctg gaa agg ata gag att cca aac ata age 
Gin Leu Gin Ser Leu Asp Leu Glu Arg lie Glu lie Pro Asn He Ser 
340 345 350 



1056 



aca gga atg ttc cag cca atg aag aac ctt tct cac att tat ttg aaa 1104 

Thr Gly Met Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Leu Lys 
355 360 365 

acc ttt cga tac tgc tec tat gtc ccc cat gtc cga ate tgt atg ccg 1152 

Thr Phe Arg Tyr Cys Ser Tyr Val Pro His Val Arg He Cys Met Pro 

370 375 380 



teg act gat ggt att tct teg tct gag gac etc ttg get aac ggt ate 
Ser Thr Asp Gly He Ser Ser Ser Glu Asp Leu Leu Ala Asn Gly He 
385 390 395 400 

etc aga gtg tct gtc tgg gtt ata get ttc att acc tgc gtt ggg aat 
Leu Arg Val Ser Val Trp Val He Ala Phe He Thr Cys Val Gly Asn 
405 410 415 

ttc ctt gtc ata gee gtg aga tct etc att aag get gag aat aca act 
Phe Leu Val He Ala Val Arg Ser Leu He Lys Ala Glu Asn Thr Thr 
420 425 430 

cac get atg tec ate aaa ate ctt tgt tgt gee gat tgc ctg atg ggg 
His Ala Met Ser He Lys He Leu Cys Cys Ala Asp Cys Leu Met Gly 
435 440 445 

gtg tac ctg ttc tec gtg ggc gtc ttt gac ate aag tac cga ggg cag 
Val Tyr Leu Phe Ser Val Gly Val Phe Asp He Lys Tyr Arg Gly Gin 
450 455 460 

tat cag aag tat gcg ctg ctg tgg atg gag agt gtg ccc tgc cgc ctg 
Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Pro Cys Arg Leu 
465 470 475 480 

ctg ggc ttc ctg gec acg ctg tec aca gag gtc teg gtg ctg ctg ctg 
Leu Gly Phe Leu Ala Thr Leu Ser Thr Glu Val Ser Val Leu Leu Leu 
485 490 495 

aca ttc ctg acg ctg gag aag ttc ctt gtc ata gta ttc cct ttc age 
Thr Phe Leu Thr Leu Glu Lys Phe Leu Val He Val Phe Pro Phe Ser 
500 505 510 

aac ctg cgc ctg ggc aag cgc cag act get gtg gee etc gec age ate 
Asn Leu Arg Leu Gly Lys Arg Gin Thr Ala Val Ala Leu Ala Ser He 
515 520 525 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 
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tgg gtg gtg gga ttt etc ata gcg gec gtt ccg ttc acc aga gag gat 1632 

Trp Val Val Gly Phe Leu lie Ala Ala Val Pro Phe Thr Arg Glu Asp 
530 535 540 

tat ttc ggc aac ttt tat ggg aaa aat gga gtc tgc ttc cca ctt cat 1680 

Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu His 

545 550 555 560 

tat gac caa gca gaa gat ttt gga agt aga ggg tac tec ctt ggg att 1728 

Tyr Asp Gin Ala Glu Asp Phe Gly Ser Arg Gly Tyr Ser Leu Gly lie 

565 570 575 

ttc eta ggt gtg aac ttg ctg get ttc etc gtc ate gtg att tec tat 1776 

Phe Leu Gly Val Asn Leu Leu Ala Phe Leu Val lie Val lie Ser Tyr 

580 585 590 

gtc acc atg ttc tgc tec att cat aaa aca gec ctt cag act gca gaa 1824 

Val Thr Met Phe Cys Ser lie His Lys Thr Ala Leu Gin Thr Ala Glu 
595 600 605 



gtg agg age cac ate ggg aag gag gtg get gtt gca aac egg ttc ttt 
Val Arg Ser His lie Gly Lys Glu Val Ala Val Ala Asn Arg Phe Phe 
610 615 620 



187; 



ttt ate gtg ttc tct gat gec ate tgc tgg ate cct gtg ttt gtc gtt 1920 

Phe lie Val Phe Ser Asp Ala lie Cys Trp lie Pro Val Phe Val Val 

625 630 635 640 

aag ate ctg tct etc ctt caa gtg gag ata cca ggc aca ate act tec 1968 

Lys lie Leu Ser Leu Leu Gin Val Glu He Pro Gly Thr He Thr Ser 
645 650 655 

tgg ate gtg gtt ttt ttc ctt ccg gtg aac age gee tta aac ccc ate 2016 

Trp lie Val Val Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro He 
660 665 670 

etc tac act ctg acg acc tec ttt ttt aag gac aag ttg aaa cag ttg 2064 

Leu Tyr Thr Leu Thr Thr Ser Phe Phe Lys Asp Lys Leu Lys Gin Leu 

675 680 685 

ctg cac aaa cat egg agg aaa ccc ate ttc aaa gtt aag aag aaa agt 2112 

Leu His Lys His Arg Arg Lys Pro He Phe Lys Val Lys Lys Lys Ser 
690 695 700 

tta tec get tec att gtg tgg aca gac gag tct tea ctt aaa ctt gga 2160 

Leu Ser Ala Ser He Val Trp Thr Asp Glu Ser Ser Leu Lys Leu Gly 

705 710 715 720 

gtg ttg age aaa ata gec ctt ggg gac agt ata atg aag ccg gtc tec 2208 

Val Leu Ser Lys He Ala Leu Gly Asp Ser He Met Lys Pro Val Ser 
725 730 735 

ccg tag 2214 
Pro 



<210> 20 
<211>. 737 
<212> PRT 
<213> Mus 



rausculus 
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<400> 20 

Met Trp Leu Leu Leu His Val lie Leu Leu Thr Glu Val Lys Asp Phe 

15 10 15 

Ala Leu Ala Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr 
20 25 30 

Phe Pro Cys Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys 
35 40 45 

Asp Gly Val Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly 
50 55 60 

Asp Thr Ser Gly Trp Thr Thr lie Phe Gly Thr Val His Gly Asn Val 

65 70 75 80 

Asn Lys Val Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin 

85 90 95 

His Cys Tyr Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu 
100 105 110 

Lys Ala Val Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys 
115 120 125 

Lys Asn Lys lie His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr 
130 135 140 

Glu Leu Arg Lys lie Tyr Leu Gin His Asn Cys lie Thr His lie Ser 

145 150 155 160 

Arg Arg Ala Phe Leu Gly Leu His Asn Leu Gin lie Leu Tyr Leu Ser 
165 170 175 

His Asn Cys lie Thr Ser Leu Arg Pro Gly lie Phe Lys Asp Leu His 
180 185 190 

Gin Leu Ala Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser 
195 200 205 

Gin Lys Ser Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val 
210 215 220 

Gly Asn Arg Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro 
225 230 235 240 

Gin Leu Asn Trp Val Asp Leu Ala Asn Asn Gly lie Lys Tyr lie Thr 
245 250 255 

Asn Ser Thr Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro 
260 265 270 

Arg Asn Gin lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys 

275 280 285 

Asn Leu Gly Glu Leu Asp Leu Ser Ser Asn Met lie Thr Lys Leu Pro 

290 295 300 

Val His Leu Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser 
305 310 315 320 
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Ser Asn Pro Leu 



Gin Leu Gin Ser 
340 

Thr Gly Met Phe 
355 

Thr Phe Arg Tyr 
370 

Ser Thr Asp Gly 
385 

Leu Arg Val Ser 



Phe Leu Val lie 
420 

His Ala Met Ser 
435 

Val Tyr Leu Phe 
450 

Tyr Gin Lys Tyr 
465 

Leu Gly Phe Leu 



Thr Phe Leu Thr 
500 

Asn Leu Arg Leu 
515 

Trp Val Val Gly 
530 

Tyr Phe Gly Asn 
545 

Tyr Asp Gin Ala 



Phe Leu Gly Val 

580 

Val Thr Met Phe 

595 

Val Arg Ser His 
610 

Phe lie Val Phe 

625 

Lys lie Leu Ser 



Leu Tyr Val His 
325 

Leu Asp Leu Glu 



Gin Pro Met Lys 

360 

Cys Ser Tyr Val 
375 

lie Ser Ser Ser 
390 

Val Trp Val lie 
405 

Ala Val Arg Ser 



lie Lys lie Leu 
440 

Ser Val Gly Val 
455 

Ala Leu Leu Trp 
470 

Ala Thr Leu Ser 
485 

Leu Glu Lys Phe 



Gly Lys Arg Gin 

520 

Phe Leu lie Ala 
535 

Phe Tyr Gly Lys 
550 

Glu Asp Phe Gly 
565 

Asn Leu Leu Ala 



Cys Ser lie His 
600 

lie Gly Lys Glu 
615 

Ser Asp Ala lie 
630 

Leu Leu Gin Val 



Lys Asn Gin Phe 

330 

Arg lie Glu lie 
345 

Asn Leu Ser His 



Pro His Val Arg 

380 

Glu Asp Leu Leu 
395 

Ala Phe lie Thr 

410 

Leu lie Lys Ala 
425 

Cys Cys Ala Asp 



Phe Asp lie Lys 
4 60 

Met Glu Ser Val 
475 

Thr Glu Val Ser 
490 

Leu Val lie Val 
505 

Thr Ala Val Ala 



Ala Val Pro Phe 
540 

Asn Gly Val Cys 
555 

Ser Arg Gly Tyr 
570 

Phe Leu Val lie 
585 

Lys Thr Ala Leu 



Val Ala Val Ala 
620 

Cys Trp lie Pro 
635 

Glu lie Pro Gly 



Gly Ser Leu Lys 
335 

Pro Asn lie Ser 

350 

lie Tyr Leu Lys 
365 

lie Cys Met Pro 



Ala Asn Gly lie 
400 

Cys Val Gly Asn 
415 

Glu Asn Thr Thr 
430 

Cys Leu Met Gly 
445 

Tyr Arg Gly Gin 



Pro Cys Arg Leu 

480 

Val Leu Leu Leu 
495 

Phe Pro Phe Ser 
510 

Leu Ala Ser lie 
525 

Thr Arg Glu Asp 



Phe Pro Leu His 
560 

Ser Leu Gly lie 
575 

Val lie Ser Tyr 
590 

Gin Thr Ala Glu 
605 

Asn Arg Phe Phe 



Val Phe Val Val 
640 

Thr lie Thr Ser 
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645 

Trp lie Val Val Phe 
660 

Leu Tyr Thr Leu Thr 

675 

Leu His Lys His Arg 
690 

Leu Ser Ala Ser lie 
705 

Val Leu Ser Lys lie 
725 

Pro 



650 

Phe Leu Pro Val Asn 
665 

Thr Ser Phe Phe Lys 

680 

Arg Lys Pro lie Phe 
695 

Val Trp Thr Asp Glu 
710 

Ala Leu Gly Asp Ser 

730 



655 

Ser Ala Leu Asn Pro lie 

670 

Asp Lys Leu Lys Gin Leu 
685 

Lys Val Lys Lys Lys Ser 

700 

Ser Ser Leu Lys Leu Gly 
715 720 

lie Met Lys Pro Val Ser 

735 



<210> 21 
<211> 718 
<212> PRT 

<213> Mus musculus 
<400> 21 

Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr Phe Pro Cys 
15 10 15 

Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Val 
20 25 30 

Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly Asp Thr Ser 
35 40 45 

Gly Trp Thr Thr He Phe Gly Thr Val His Gly Asn Val Asn Lys Val 
50 55 60 

Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin His Cys Tyr 
65 70 75 80 

Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu Lys Ala Val 
85 90 95 

Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 HO 

He His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr Glu Leu Arg 
115 120 125 

Lys He Tyr Leu Gin His Asn Cys He Thr His He Ser Arg Arg Ala 
130 135 140 

Phe Leu Gly Leu His Asn Leu Gin He Leu Tyr Leu Ser His Asn Cys 
145 150 155 160 

He Thr Ser Leu Arg Pro Gly He Phe Lys Asp Leu His Gin Leu Ala 
165 170 175 

Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He Ser Gin Lys Ser 
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180 185 190 

Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Gly Asn Arg 

195 200 205 

Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

Trp Val Asp Leu Ala Asn Asn Gly lie Lys Tyr lie Thr Asn Ser Thr 
225 230 235 240 

Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 
245 250 255 

lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 
260 265 270 

Glu Leu Asp Leu Ser Ser Asn Met lie Thr Lys Leu Pro Val His Leu 
275 280 285 

Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 
290 295 300 

Leu Leu Tyr Val His Lys Asn Gin Phe Gly Ser Leu Lys Gin Leu Gin 
305 310 315 320 

Ser Leu Asp Leu Glu Arg lie Glu lie Pro Asn lie Ser Thr Gly Met 

325 330 335 

Phe Gin Pro Met Lys Asn Leu Ser His lie Tyr Leu Lys Thr Phe Arg 
340 345 350 

Tyr Cys Ser Tyr Val Pro His Val Arg lie Cys Met Pro Ser Thr Asp 
355 360 365 

Gly He Ser Ser Ser Glu Asp Leu Leu Ala Asn Gly He Leu Arg Val 
370 375 380 

Ser Val Trp Val lie Ala Phe He Thr Cys Val Gly Asn Phe Leu Val 
385 390 395 400 

He Ala Val Arg Ser Leu He Lys Ala Glu Asn Thr Thr His Ala Met 
405 410 415 

Ser He Lys lie Leu Cys Cys Ala Asp Cys Leu Met Gly Val Tyr Leu 
420 425 430 

Phe Ser Val Gly Val Phe Asp He Lys Tyr Arg Gly Gin Tyr Gin Lys 
435 440 445 

Tyr Ala Leu Leu Trp Met Glu Ser Val Pro Cys Arg Leu Leu Gly Phe 
450 455 460 

Leu Ala Thr Leu Ser Thr Glu Val Ser Val Leu Leu Leu Thr Phe Leu 
465 470 475 480 

Thr Leu Glu Lys Phe Leu Val He Val Phe Pro Phe Ser Asn Leu Arg 
485 490 495 

Leu Gly Lys Arg Gin Thr Ala Val Ala Leu Ala Ser He Trp Val Val 
500 505 510 
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Gly Phe Leu lie Ala Ala Val Pro Phe Thr Arg Glu Asp Tyr Phe Gly 
515 520 525 

Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro Leu His Tyr Asp Gin 
530 535 540 

Ala Glu Asp Phe Gly Ser Arg Gly Tyr Ser Leu Gly He Phe Leu Gly 
545 550 555 560 

Val Asn Leu Leu Ala Phe Leu Val He Val He Ser Tyr Val Thr Met 
565 570 575 

Phe Cys Ser He His Lys Thr Ala Leu Gin Thr Ala Glu Val Arg Ser 
580 585 590 

His He Gly Lys Glu Val Ala Val Ala Asn Arg Phe Phe Phe He Val 
595 600 605 

Phe Ser Asp Ala He Cys Trp He Pro Val Phe Val Val Lys He Leu 
610 615 620 

Ser Leu Leu Gin Val Glu He Pro Gly Thr He Thr Ser Trp He Val 
625 630 635 640 

Val Phe Phe Leu Pro Val Asn Ser Ala Leu Asn Pro He Leu Tyr Thr 
645 650 655 

Leu Thr Thr Ser Phe Phe Lys Asp Lys Leu Lys Gin Leu Leu His Lys 
660 665 670 

His Arg Arg Lys Pro He Phe Lys Val Lys Lys Lys Ser Leu Ser Ala 
675 680 685 

Ser He Val Trp Thr Asp Glu Ser Ser Leu Lys Leu Gly Val Leu Ser 
690 695 700 



Lys He Ala Leu Gly Asp Ser He Met Lys Pro Val Ser Pro 
705 710 715 



<210> 22 

<211> 1140 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (1140) 

<400> 22 

gac age agt atg gtg get cct ctg tgc ccc aaa ggg tat ttt ccc tgt 4 8 

Asp Ser Ser Met Val Ala Pro Leu Cys Pro Lys Gly Tyr Phe Pro Cys 

15 10 15 

ggg aat etc acc aaa tgc ttg ccc cga gec ttt cac tgc gat ggt gtg 96 
Gly Asn Leu Thr Lys Cys Leu Pro Arg Ala Phe His Cys Asp Gly Val 
20 25 30 

gat gat tgc ggg aat ggt gec gac gag gac aac tgt ggt gac act agt 14 
Asp Asp Cys Gly Asn Gly Ala Asp Glu Asp Asn Cys Gly Asp Thr Ser 
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35 40 45 

gga tgg aca acc ata ttt ggc aca gtc cat ggg aat gtc aat aaa gtg 192 

Gly Trp Thr Thr lie Phe Gly Thr Val His Gly Asn Val Asn Lys Val 

50 55 60 

aca ttg aca cag gag tgc ttt etc age cag tat cca cag cac tgt tac 240 

Thr Leu Thr Gin Glu Cys Phe Leu Ser Gin Tyr Pro Gin His Cys Tyr 

65 70 75 80 

tgc aga gaa aat gaa ctg gaa tgt gta aag get gac tta aaa get gtg 288 

Cys Arg Glu Asn Glu Leu Glu Cys Val Lys Ala Asp Leu Lys Ala Val 
85 90 95 

cca aag gtt tec age aac gta aca tta eta tct ctt aag aaa aac aaa 336 

Pro Lys Val Ser Ser Asn Val Thr Leu Leu Ser Leu Lys Lys Asn Lys 
100 105 110 

ate cac aga ctt cca gtc aag gtc ttc age aga tac aca gaa etc aga 384 

lie His Arg Leu Pro Val Lys Val Phe Ser Arg Tyr Thr Glu Leu Arg 
115 120 125 

aag ata tac ctt cag cac aac tgc ate aca cac ate tec agg aga gca 432 

Lys lie Tyr Leu Gin His Asn Cys lie Thr His lie Ser Arg Arg Ala 
130 135 140 

ttc ctt gga tta cat aat eta caa ata ctg tat etc age cat aac tgc 480 

Phe Leu Gly Leu His Asn Leu Gin lie Leu Tyr Leu Ser His Asn Cys 

145 150 155 160 

att acc tct etc agg cct ggg ata ttc aaa gac ttg cat cag ctt get 528 

lie Thr Ser Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Ala 
165 170 175 

tgg eta att tta gat gac aac ccg ate acc aga ate tea cag aag tec 576 

Trp Leu He Leu Asp Asp Asn Pro He Thr Arg He Ser Gin Lys Ser 
180 185 190 

ttt atg ggg tta aac tec ttg ttt ttc ttg tec atg gtg ggt aac egg 624 

Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Gly Asn Arg 
195 200 205 

etc gag gec ctt cct gaa aca ttg tgt get cag atg cct caa etc aac 672 

Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

tgg gtg gat ctg gca aac aat gga ata aag tac ata acg aac tec acc 720 
Trp Val Asp Leu Ala Asn Asn Gly He Lys Tyr He Thr Asn Ser Thr 

225 230 235 240 

ttc eta acg tgc gac teg etc acg gtt ctg ttt ctg cct aga aat caa 768 
Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 
245 250 255 

att ggt ttt gtt cca gag aag aca ttt tct tea tta aaa aat tta gga 816 
He Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 
260 265 270 

gaa ctg gac ctg tct age aat atg ata aca aaa etc cca gtc cac ctt 864 
Glu Leu Asp Leu Ser Ser Asn Met He Thr Lys Leu Pro Val His Leu 
275 280 285 
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ttc age gac ctt cat ctt etc cag aag ctg aac ctg tea tec aac cct 912 

Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 

290 295 300 

ctt ctg tat gtc cac aag aac cag ttt gga agt etc aaa caa ctt cag 960 

Leu Leu Tyr Val His Lys Asn Gin Phe Gly Ser Leu Lys Gin Leu Gin 

305 310 315 320 

tct eta gac ctg gaa agg ata gag att cca aac ata age aca gga atg 1008 

Ser Leu Asp Leu Glu Arg He Glu He Pro Asn He Ser Thr Gly Met 

325 330 335 

ttc cag cca atg aag aac ctt tct cac att tat ttg aaa ace ttt cga 1056 

Phe Gin Pro Met Lys Asn Leu Ser His He Tyr Leu Lys Thr Phe Arg 

340 345 350 

tac tgc tec tat gtc ccc cat gtc cga ate tgt atg ccg teg act gat 1104 

Tyr Cys Ser Tyr Val Pro His Val Arg He Cys Met Pro Ser Thr Asp 

355 360 365 

ggt att tct teg tct gag gac etc ttg get aac ggt 1140 

Gly He Ser Ser Ser Glu Asp Leu Leu Ala Asn Gly 

370 375 380 



<210> 23 
<211> 380 
<212> PRT 
<213> Mus muscu. 

<400> 23 
Asp Ser Ser Met 
1 

Gly Asn Leu Thr 
20 

Asp Asp Cys Gly 
35 

Gly Trp Thr Thr 
50 

Thr Leu Thr Gin 

65 

Cys Arg Glu Asn 



Pro Lys Val Ser 
100 

He His Arg Leu 
115 

Lys He Tyr Leu 
130 

Phe Leu Gly Leu 
145 



.us 

Val Ala Pro Leu 
5 

Lys Cys Leu Pro 

Asn Gly Ala Asp 
40 

He Phe Gly Thr 
55 

Glu Cys Phe Leu 
70 

Glu Leu Glu Cys 
85 

Ser Asn Val Thr 

Pro Val Lys Val 
120 

Gin His Asn Cys 
135 

His Asn Leu Gin 
150 



Cys Pro Lys Gly 
10 

Arg Ala Phe His 
25 

Glu Asp Asn Cys 



Val His Gly Asn 
60 

Ser Gin Tyr Pro 
75 

Val Lys Ala Asp 

90 

Leu Leu Ser Leu 

105 

Phe Ser Arg Tyr 



He Thr His He 
140 

lie Leu Tyr Leu 
155 



Tyr Phe Pro Cys 
15 

Cys Asp Gly Val 
30 

Gly Asp Thr Ser 
45 

Val Asn Lys Val 



Gin His Cys Tyr 
80 

Leu Lys Ala Val 
95 

Lys Lys Asn Lys 
110 

Thr Glu Leu Arg 
125 

Ser Arg Arg Ala 



Ser His Asn Cys 
160 
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lie Thr Ser Leu Arg Pro Gly lie Phe Lys Asp Leu His Gin Leu Ala 
165 170 175 

Trp Leu lie Leu Asp Asp Asn Pro lie Thr Arg lie Ser Gin Lys Ser 
130 185 190 

Phe Met Gly Leu Asn Ser Leu Phe Phe Leu Ser Met Val Gly Asn Arg 
195 200 205 

Leu Glu Ala Leu Pro Glu Thr Leu Cys Ala Gin Met Pro Gin Leu Asn 
210 215 220 

Trp Val Asp Leu Ala Asn Asn Gly lie Lys Tyr lie Thr Asn Ser Thr 

225 230 235 240 

Phe Leu Thr Cys Asp Ser Leu Thr Val Leu Phe Leu Pro Arg Asn Gin 
245 250 ■ 255 

lie Gly Phe Val Pro Glu Lys Thr Phe Ser Ser Leu Lys Asn Leu Gly 
260 265 270 

Glu Leu Asp Leu Ser Ser Asn Met lie Thr Lys Leu Pro Val His Leu 
275 280 285 

Phe Ser Asp Leu His Leu Leu Gin Lys Leu Asn Leu Ser Ser Asn Pro 
290 295 300 

Leu Leu Tyr Val His Lys Asn Gin Phe Gly Ser Leu Lys Gin Leu Gin 
305 310 315 320 

Ser Leu Asp Leu Glu Arg lie Glu lie Pro Asn lie Ser Thr Gly Met 
325 330 335 

Phe Gin Pro Met Lys Asn Leu Ser His lie Tyr Leu Lys Thr Phe Arg 
340 345 350 

Tyr Cys Ser Tyr Val Pro His Val Arg lie Cys Met Pro Ser Thr Asp 
355 360 365 



Gly lie Ser Ser Ser Glu Asp Leu Leu Ala Asn Gly 
370 375 380 



<210> 24 

<211> 757 

<212> PRT 

<213> Homo sapi< 

<400> 24 
Met Thr Ser Gly 
1 

Phe Ser His Gly 
20 

Pro Cys Gly Asn 
35 

Gly Val Asp Asp 
50 



sns 

Ser Val Phe Phe 

5 

Gly Gly Gin Asp 

lie Thr Lys Cys 
40 

Cys Gly Asn Gin 
55 



Tyr lie Leu lie 
10 

Val Lys Cys Ser 
25 

Leu Pro Gin Leu 



Ala Asp Glu Asp 

60 



Phe Gly Lys Tyr 
15 

Leu Gly Tyr Phe 
30 

Leu His Cys Asn 
45 

Asn Cys Gly Asp 
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Asn Asn Gly Trp Ser Met Gin Phe Asp Lys Tyr Phe Ala Ser Tyr Tyr 
65 70 75 80 

Lys Met Thr Ser Gin Tyr Pro Phe Glu Ala Glu Thr Pro Glu Cys Leu 
85 90 95 

Val Gly Ser Val Pro Val Gin Cys Leu Cys Gin Gly Leu Glu Leu Asp 
100 105 110 

Cys Asp Glu Thr Asn Leu Arg Ala Val Pro Ser Val Ser Ser Asn Val 
115 120 125 

Thr Ala Met Ser Leu Gin Trp Asn Leu lie Arg Lys Leu Pro Pro Asp 
130 135 140 

Cys Phe Lys Asn Tyr His Asp Leu Gin Lys Leu Tyr Leu Gin Asn Asn 
145 150 155 160 

Lys lie Thr Ser lie Ser lie Tyr Ala Phe Arg Gly Leu Asn Ser Leu 
165 170 175 

Thr Lys Leu Tyr Leu Ser His Asn Arg lie Thr Phe Leu Lys Pro Gly 
180 185 190 

Val Phe Glu Asp Leu His Arg Leu Glu Trp Leu lie lie Glu Asp Asn 
195 200 205 

His Leu Ser Arg lie Ser Pro Pro Thr Phe Tyr Gly Leu Asn Ser Leu 
210 215 220 

lie Leu Leu Val Leu Met Asn Asn Val Leu Thr Arg Leu Pro Asp Lys 

225 230 235 240 

Pro Leu Cys Gin His Met Pro Arg Leu His Trp Leu Asp Leu Glu Gly 
245 250 255 

Asn His lie His Asn Leu Arg Asn Leu Thr Phe He Ser Cys Ser Asn 
260 265 270 

Leu Thr Val Leu Val Met Arg Lys Asn Lys He Asn His Leu Asn Glu 
275 280 285 

Asn Thr Phe Ala Pro Leu Gin Lys Leu Asp Glu Leu Asp Leu Gly Ser 
290 295 300 

Asn Lys He Glu Asn Leu Pro Pro Leu He Phe Lys Asp Leu Lys Glu 
305 310 315 320 

Leu Ser Gin Leu Asn Leu Ser Tyr Asn Pro He Gin Lys He Gin Ala 
325 330 335 

Asn Gin Phe Asp Tyr Leu Val Lys Leu Lys Ser Leu Ser Leu Glu Gly 
340 345 350 

He Glu He Ser Asn He Gin Gin Arg Met Phe Arg Pro Leu Met Asn 
355 360 365 

Leu Ser His He Tyr Phe Lys Lys Phe Gin Tyr Cys Gly Tyr Ala Pro 
370 375 380 
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His Val Arg Ser 
385 

Asn Leu Leu Ala 



Ala Val Thr Cys 
420 

lie Arg Ser Glu 
435 

Cys Ala Asp Cys 
450 

Asp Leu Lys Phe 
465 

Glu Ser Thr His 



Glu Val Ser Val 
500 

Cys lie Val Tyr 
515 

lie Thr Val Leu 
530 

lie Pro Leu Ser 
545 

Gly Val Cys Phe 



Gin lie Tyr Ser 
580 

lie lie lie Val 

595 

Ser Ala lie Thr 

610 

lie Leu Ala Lys 

625 

Trp lie Pro lie 



He Pro Gly Thr 
660 

Asn Ser Ala Leu 
675 

Lys Glu Met He 
690 

Met Asp Ser Lys 



Cys Lys Pro Asn 
390 

Ser He He Gin 
405 

Phe Gly Asn He 



Asn Lys Leu Tyr 
440 

Leu Met Gly He 
455 

Arg Gly Glu Tyr 
470 

Cys Gin Leu Val 
485 

Leu Leu Leu Thr 



Pro Phe Arg Cys 
520 

He Leu He Trp 
535 

Asn Lys Glu Phe 
550 

Pro Leu His Ser 
565 

Val Ala He Phe 



Phe Ser Tyr Gly 
600 

Ala Thr Glu He 
615 

Arg Phe Phe Phe 
630 

Phe Val Val Lys 
645 

He Thr Ser Trp 



Asn Pro He Leu 
680 

His Arg Phe Trp 
695 

Gly Gin Lys Thr 



Thr Asp Gly lie 
395 

Arg Val Phe Val 
410 

Phe Val He Cys 
425 

Ala Met Ser He 



Tyr Leu Phe Val 
460 

Asn Lys His Ala 
475 

Gly Ser Leu Ala 
490 

Phe Leu Thr Leu 
505 

Val Arg Pro Gly 



He Thr Gly Phe 
540 

Phe Lys Asn Tyr 
555 

Glu Asp Thr Glu 

570 

Leu Gly He Asn 
585 

Ser Met Phe Tyr 



Arg Asn Gin Val 
620 

He Val Phe Thr 
635 

Phe Leu Ser Leu 
650 

Val Val He Phe 
665 

Tyr Thr Leu Thr 



Tyr Asn Tyr Arg 
700 

Tyr Ala Pro Ser 



Ser Ser Leu Glu 
400 

Trp Val Val Ser 
415 

Met Arg Pro Tyr 
430 

He Ser Leu Cys 
445 

He Gly Gly Phe 



Gin Leu Trp Met 

480 

lie Leu Ser Thr 
495 

Glu Lys Tyr He 
510 

Lys Cys Arg Thr 
525 

He Val Ala Phe 



Tyr Gly Thr Asn 
560 

Ser lie Gly Ala 
575 

Leu Ala Ala Phe 
590 

Ser Val His Gin 
605 

Lys Lys Glu Met 



Asp Ala Leu Cys 
640 

Leu Gin Val Glu 
655 

He Leu Pro lie 
670 

Thr Arg Pro Phe 
685 

Gin Arg Lys Ser 



Phe He Trp Val 
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705 710 715 720 

Glu Met Trp Pro Leu Gin Glu Met Pro Pro Glu Leu Met Lys Pro Asp 
725 730 735 

Leu Phe Thr Tyr Pro Cys Glu Met Ser Leu lie Ser Gin Ser Thr Arg 
740 745 750 

Leu Asn Ser Tyr Ser 
755 



<210> 25 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 25 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 26 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: internalizing 
domain derived from HIV tat protein 

<400> 26 

Gly Gly Gly Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 15 



<210> 27 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 27 

tgccaaaaag gatattttcc ctgtgggaat ctta 34 



<210> 28 
<211> 47 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 28 

ctaggaaact ggtttcatta tactgtctcc aagtgttatt ttgttca 47 



<210> 29 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: RACE primer 
<400> 29 

ccatcctaat acgactcact atagggc 27 



<210> 30 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: RACE primer 



<400> 30 

attgtcatct agaattagcc aagttagctg at 



<210> 31 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> Description of Artificial Sequence: RACE primer 
<400> 31 

aacaaggaat ttaatcccgt aaacaag 



<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: RACE primer 
<400> 32 

actcactata gggctcgagc ggc 



<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: RACE primer 
<400> 33 

atattccagg tctgagggtt gtgat 



<210> 34 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: RACE primer 
<400> 34 

atattccagg tctgagggtt gtgat 



<210> 35 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 35 

ctgctttgga aatctttttg tea 



<210> 36 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 36 

ttttccaggt cgaatgttac tga 



<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 37 

atgecttget gtggatggag 



<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 38 

acttcggtgg acagcatgg 



<210> 39 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide probe 

<400> 39 

cgtgcagtgc cgcctcatgg 20 

<210> 40 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<210> 41 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 41 

tctttgggac cttgtctgca a 21 



<210> 42 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide probe 



<400> 40 

gtcgacggcg agccc 



15 



<400> 42 

tgggccgcgt ctcctttgag ct 



22 
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